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PREFACE 


My main purpose in this book is to present a unified treatment 
of that part of measure theory which in recent years has shown 
itself to be most useful for its applications in modern analysis. 
If I have accomplished my purpose, then the book should be 
found usable both as a text for students and as a source of refer- 
ence for the more advanced mathematician. 

I have tried to keep to a minimum the amount of new and 
unusual terminology and notation. In the few places where my 
nomenclature differs from that in the existing literature of meas- 
ure theory, I was motivated by an attempt to harmonize with 
the usage of other parts of mathematics. There are, for instance, 
sound algebraic reasons for using the terms “lattice” and “ring” 
for certain classes of sets—reasons which are more cogent than 
the similarities that caused Hausdorff to use ‘‘ring” and ‘“‘field.” 

The only necessary prerequisite for an intelligent reading of 
the first seven chapters of this book is what is known in the 
United States as undergraduate algebra and analysis. For the 
convenience of the reader, § 0 is devoted to a detailed listing of 
exactly what knowledge is assumed in the various chapters. The 
beginner should be warned that some of the words and symbols 
in the latter part of § 0 are defined only later, in the first seven 
chapters of the text, and that, accordingly, he should not be dis- 
couraged if, on first reading of § 0, he finds that he does not have 
the prerequisites for reading the prerequisites. 

At the end of almost every section there is a set of exercises 
which appear sometimes as questions but more usually as asser- 
tions that the reader is invited to prove. These exercises should 


be viewed as corollaries to and sidelights on the results more 
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formally expounded. They constitute an integral part of the 
book; among them appear not only most of the examples and 
counter examples necessary for understanding the theory, but 
also definitions of new concepts and, occasionally, entire theories 
that not long ago were still subjects of research. 

It might appear inconsistent that, in the text, many elementary 
notions are treated in great detail, while, in the exercises, some quite 
refined and profound matters (topological spaces, transfinite num- 
bers, Banach spaces, etc.) are assumed to be known. The mate- 
rial is arranged, however, so that when a beginning student comes 
to an exercise which uses terms not defined in this book he may 
simply omit it without loss of continuity. The more advanced 
reader, on the other hand, might be pleased at the interplay 
between measure theory and other parts of mathematics which 
it is the purpose of such exercises to exhibit. 

The symbol §j is used throughout the entire book in place of 
such phrases as “Q.E.D.” or “This completes the proof of the 
theorem”’ to signal the end of a proof. 

At the end of the book there is a short list of references and a 
bibliography. I make no claims of completeness for these lists. 
Their purpose is sometimes to mention background reading, 
rarely (in cases where the history of the subject is not too well 
known) to give credit for original discoveries, and most often to 
indicate directions for further study. 

A symbol such as u.v, where w is an integer and v is an integer 
or a letter of the alphabet, refers to the (unique) theorem, formula, 
or exercise in section uw which bears the label v. 
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§0. PREREQUISITES 


The only prerequisite for reading and understanding the first 
seven chapters of this book is a knowledge of elementary algebra 
and analysis. Specifically it is assumed that the reader is familiar 
with the concepts and results listed in (1)-(7) below. 

(1) Mathematical induction, commutativity and associativity 
of algebraic operations, linear combinations, equivalence relations 
and decompositions into equivalence classes. 

(2) Countable sets; the union of countably many countable 
sets is countable. 

(3) Real numbers, elementary metric and topological properties 
of the real line (e.g. the rational numbers are dense, every open 
set is a countable union of disjoint open intervals), the Heine—- 
Borel theorem. 

(4) The general concept of a function and, in particular, of a 
sequence (i.e. a function whose domain of definition is the set of 
positive integers); sums, products, constant multiples, and abso- 
lute values of functions. 

(5) Least upper and greatest lower bounds (called suprema and 
infima) of sets of real numbers and real valued functions; limits, 
superior limits, and inferior limits of sequences of real numbers 
and real valued functions. 

(6) The symbols ++ and —», and the following algebraic rela- 
tions among them and real numbers «x: 


(00) + (40) = x + (40) = (40) + % = +0; 

to ifx>0, 
(0) = (+00)x = 10 if x = 0, 
Fo ifx <0; 

(+0)(0) = +o, 

(+00)(Fo) = —o; 

x/(0) = 0; 
—~a~<i x < +o, 
1 
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The phrase extended real number refers to a real number or one 
of the symbols +. 
(7) If x and y are real numbers, 


x Uy = max {xy} = F@+y4+|«-y)), 
xy =min {x,y} = $@+y —|x-y|). 


Similarly, if f and g are real valued functions, then f U g and 
ff g are the functions defined by 


FU g)(x) = f(*) U ge) and (fN g)(*) = f(x) A g(x), 


respectively. The supremum and infimum of a sequence {xn} 
of real numbers are denoted by 


Ua %, and Obs Xny 
respectively. In this notation 
lim supa Xn = (\nu: Unan Xm 


lim inf, xn = Ura: (nun %m- 


In Chapter VIII the concept of metric space is used, together 
with such related concepts as completeness and separability for 
metric spaces, and uniform continuity of functions on metric 
spaces. In Chapter VIII use is made also of such slightly more 
sophisticated concepts of real analysis as one-sided continuity. 

In the last section of Chapter IX, Tychonoff’s theorem on the 
compactness of product spaces is needed (for countably many 
factors each of which is an interval). 

In general, each chapter makes free use of all preceding chap- 
ters; the only major exception to this is that Chapter IX is not 
needed for the last three chapters. 

In Chapters X, XI, and XII systematic use is made of many 
of the concepts and results of point set topology and the elements 
of topological group theory. We append below a list of all the 
relevant definitions and theorems. The purpose of this list is not 
to serve as a text on topology, but (a) to tell the expert exactly 


and 
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which forms of the relevant concepts and results we need, (b) to 
tell the beginner with exactly which concepts and results he should 
familiarize himself before studying the last three chapters, (c) to 
put on record certain, not universally used, terminological con- 
ventions, and (d) to serve as an easily available reference for 
things which the reader may wish to recall. 


Topological Spaces 


A topological space is a set X and a class of subsets of X, called 
the open sets of X, such that the class contains 0 and X and is 
closed under the formation of finite intersections and arbitrary 
(i.e. not necessarily finite or countable) unions. A subset E 
of X is called a G; if there exists a sequence {U,} of open sets 
such that E = ()%_, Un. The class of all Gs’s is closed under the 
formation of finite unions and countable intersections. The topo- 
logical space X is discrete if every subset of X is open, or, equiva- 
lently, if every one-point subset of X is open. A set E is closed 
if X — Eis open. The class of closed sets contains 0 and X and 
is closed under the formation of finite unions and arbitrary inter- 
sections. The interior, E°, of a subset E of X is the greatest open 
set contained in E; the closure, £, of E is the least closed set con- 
taining E. Interiors are open sets and closures are closed sets; 
if E is open, then E° = E, and, if E is closed, then E = E. The 
closure of a set E is the set of all points x such that, for every open 
set U containing x, EN U0. Aset Eis dense in Xif E = X. 
A subset Y of a topological space becomes a topological space 
(a subspace of X) in the relative topology if exactly those subsets 
of Y are called open which may be obtained by intersecting an 
open subset of X with Y. A neighborhood of a point x in X 
[or of a subset E of X] is an open set containing x [or an open set 
containing E]. A base is a class B of open sets such that, for 
every x in X and every neighborhood U of x, there exists a set 
B in B such that xe BC U. The topology of the real line is 
determined by the requirement that the class of all open intervals 
be a base. A subbase is a class of sets, the class of all finite inter- 
sections of which is a base. A space X is separable if it has a 
countable base. A subspace of a separable space is separable. 
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An open covering of a subset E of a topological space X is a 
class K of open sets such that Ec UK. If X is separable and 
K is an open covering of a subset E of X, then there exists a 
countable subclass { Ki, Ka, ---} of K which is an open covering 
of E. Aset Ein X is compact if, for every open covering K of E, 
there exists a finite subclass { Ky, ---, Kn} of K which is an open 
covering of E. A class K of sets has the finite intersection prop- 
erty if every finite subclass of K has a non empty intersection. 
A space X is compact if and only if every class of closed sets with 
the finite intersection property has a non empty intersection. A 
set E in a space X is o—compact if there exists a sequence {C,} 
of compact sets such that E = U%.1C,. A space X is locally 
compact if every point of X has a neighborhood whose closure is 
compact. A subset E of a locally compact space is bounded if 
there exists a compact set C such that ECC. The class of all 
bounded open sets in a locally compact space is a base. A closed 
subset of a bounded set is compact. A subset E of a locally com- 
pact space is o—bounded if there exists a sequence {C,,} of compact 
sets such that E c U%_,C,. To any locally compact but not 
compact topological space X there corresponds a compact space 
X* containing X and exactly one additional point «*; X* is called 
the one-point compactification of X by x*. The open sets of X* 
are the open subsets of X and the complements (in X*) of the 
closed compact subsets of X. 

If {X;: ie I} is a class of topological spaces, their Cartesian 
product is the set X = XK {X;: ie J} of all functions x defined 
on I and such that, for each 7 in J, x(#) e X;. For a fixed ig in 
I, let Fy be an open subset of X;,,, and, for i ¥ io, write E; = Xj; 
the cHW§s of open sets in X is determined by the requirement that 
the class of all sets of the form XK {E;: i eZ} be a subbase. If 
the function &; on X is defined by é;(~) = x(#), then &; is continu- 
ous. The Cartesian product of any class of compact spaces is 
compact. 

A topological space is a Hausdorff space if every pair of distinct 
points have disjoint neighborhoods. Two disjoint compact sets 
in a Hausdorff space have disjoint neighborhoods. A compact 
subset of a Hausdorff space is closed. If a locally compact space 
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is a Hausdorff space or a separable space, then so is its one-point 
compactification. A real valued continuous function on a compact 
set is bounded. 

For any topological space X we denote by § (or §(X)) the class 
of all real valued continuous functions f such that 0 S$ f(~) <1 
for all « in X. A Hausdorff space is completely regular if, for 
every point y in X and every closed set F not containing y, there 
is a function f in § such that f(y) = 0 and, for « in F, f(x) = 1. 
A locally compact Hausdorff space is completely regular. 

A metric space is a set X and a real valued function d (called 
distance) on X X_X, such that d(x,y) 2 0, d(«,y) = 0 if and only 
if x = y, d(x,y) = d(y,x), and d(x,y) S d(x,z) + d(z,y). If E and 
F are non empty subsets of a metric space X, the distance between 
them is defined to be the number d(E,F) = inf {d(x,y): x ¢ E, 
yeF}. If F = {xo} is a one-point set, we write d(E,xo) in place 
of d(E,{xo}). A sphere (with center x9 and radius ro) is a subset 
E of a metric space X such that, for some point *9 and some posi- 
tive number ro, E = {x: d(xo,x) < ro}. The topology of a metric 
space is determined by the requirement that the class of all 
spheres be a base. A metric space is completely regular. A closed 
set in a metric space is a Gs. A metric space is separable if and only 
if it contains a countable dense set. If E is a subset of a metric 
space and f(x) = d(E,«), then f is a continuous function and 
E = {x:f(x) = 0}. If X is the real line, or the Cartesian product 
of a finite number of real lines, then X is a locally compact separa- 
ble Hausdorff space; it is even a metric space if for x = (%1, +++, Xn) 
and y= (y1, --+, yn) the distance d(x,y) is defined to be 
(diie1 (%s — y:)?)%. A closed interval in the real line is a, com- 
pact set. "3 

A transformation T from a topological space X into a topological 
space Y is continuous if the inverse image of every open set is 
open, or, equivalently, if the inverse image of every closed set is 
closed. The transformation T is open if the image of every open 
set is open. If B is a subbase in Y, then a necessary and sufficient 
condition that 7 be continuous is that J—'(B) be open for every 
B in B. If a continuous transformation T maps X onto Y, and 
if X is compact, then Y is compact. A homeomorphism is a one 
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to one, continuous transformation of X onto Y whose inverse is 
also continuous. 

The sum of a uniformly convergent series of real valued, con- 
tinuous functions is continuous. If f and g are real valued con- 
tinuous functions, then f U g and f / g are continuous. 


Topological Groups 


A group is a non empty set X of elements for which an associa- 
tive multiplication is defined so that, for any two elements @ and 
4 of X, the equations ax = 4 and ya = bare solvable. In every 
group X there is a unique identity element ¢, characterized by 
the fact that ex = xe = x for every x in X. Each element « 
of X has a unique inverse, x~', characterized by the fact that 
xx! = «7'x =e. A non empty subset Y of X is a subgroup 
if x~'y e Y whenever x and y are in Y. If E is any subset of a 
group X, E~! is the set of all elements of the form x~', where 
x e E; if E and F are any two subsets of X, EF is the set of all 
elements of the form xy, where xe E and ye F. A non empty 
subset Y of X is a subgroup if and only if Y7Y CY. If«eX, 
it is customary to write xE and Ex in place of {x}E and E{x} 
respectively; the set xE [or Ex] is called a left translation [or right 
translation] of F. If Y is a subgroup of X, the sets xY and Yx 
are called (left and right) cosets of Y. A subgroup Y of X is 
invariant if xY = Yx for every x in X. If the product of two 
cosets Y, and Y2 of an invariant subgroup Y is defined to be 
Y,Y2, then, with respect to this notion of multiplication, the class 
of all cosets is a group X, called the quotient group of X modulo 
Y and denoted by X/Y. The identity element @ of X is Y. If 
Y is an invariant subgroup of X, and if for every x in X, r(x) 
is the coset of Y which contains x, then the transformation + 
is called the projection from X onto X. A homomorphism is a 
transformation T from a group X into a group Y such that 
T(xy) = T(x)T(y) for every two elements x and y of X. The 
projection from a group X onto a quotient group X is a homo- 
morphism. 

A topological group is a group X which is a Hausdorff space 
such that the transformation (from X X X onto X) which sends 
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(x,y) into x7!y is continuous. A class N of open sets containing 
e in a topological group is a base at ¢ if (a) for every x different 
from ¢ there exists a set U in N such that x e’ U, (b) for any two 
sets U and V inN there exists aset VW in N such thatW Cc UN V, 
(c) for any set U in N there exists a set Y in N such that 
VV CU, (d) for any set U in N and any element x in X, there 
exists a set V in N such that V Cc xUx™", and (e) for any set U 
in N and any element x in U there exists a set V in N such that 
Vx CU. The class of all neighborhoods of ¢ is a base at ¢; con- 
versely if, in any group X, N is a class of sets satisfying the condi- 
tions described above, and if the class of all translations of sets 
of N is taken for a base, then, with respect to the topology so 
defined, X becomes a topological group. A neighborhood V of e 
is symmetric if VY = V~'; the class of all symmetric neighbor- 
hoods of ¢ is a base at e. If N is a base at ¢ and if F is any closed 
set in X, then F = () {UF: UeN}. 

The closure of a subgroup [or of an invariant subgroup] of a 
topological group X is a subgroup [or an invariant subgroup] of 
X. If Y is a closed invariant subgroup of X, and if a subset of 
X = X/Y is called open if and only if its inverse image (under 
the projection 7) is open in X, then X is a topological group and 
the transformation x from X onto X is open and continuous. 

If C is a compact set and U is an open set in a topological group 
X, and if C c U, then there exists a neighborhood V of e such 
that VCV < U. If C and D are two disjoint compact sets, then 
there exists a neighborhood U of e such that UCU and UDU 
are disjoint. If C and D are any two compact sets, then C7! 
and CD are also compact. 

A subset E of a topological group X is bounded if, for every 
neighborhood U of e, there exists a finite set {x1, --+, xn} (which, 
in case E ~0, may be assumed to be a subset of E) such 
that E c Uf. «.U; if X is locally compact, then this definition 
of boundedness agrees with the one applicable in any locally com- 
pact space (i.e. the one which requires that the closure of E be 
compact). If a continuous, real valued function f on X is such 
that the set N(f) = {x:/(x) ~ 0} is bounded, then / is uniformly 
continuous in the sense that to every positive number ¢ there 


8 PREREQUISITES . [Ssc. 0] 


corresponds a neighborhood U of e¢ such that | f(*1) — f(x2) | < ¢ 
whenever x,%27! e U. 

A topological group is locally bounded if there exists in it a 
bounded neighborhood of e. To every locally bounded topo- 
logical group X, there corresponds a locally compact topological 
group X*, called the completion of X (uniquely determined to 
within an isomorphism), such that X is a dense subgroup of X*. 
Every closed subgroup and every quotient group of a locally 
compact group is a locally compact group. 


Chapter 1 


SETS AND CLASSES 


§ 1. seT INCLUSION 


Throughout this book, whenever the word set is used, it will 
be interpreted to mean a subset of a given set, which, unless it is 
assigned a different symbol in a special context, will be denoted 
by X. The elements of X will be called points; the set X will 
be referred to as the space, or the whole or entire space, under 
consideration. The purpose of this introductory chapter is to de- 
fine the basic concepts of the theory of sets, and to state the 
principal results which will be used constantly in what follows. 

If x is a point of X and E is a subset of X, the notation 


xek 


means that x belongs to E (i.e. that one of the points of E is x); 
the negation of this assertion, i.e. the statement that x does not 
belong to E, will be denoted by 


we E. 
Thus, for example, for every point x of X, we have 
xeX, 
and for no point x of X do we have 
xe’ X. 
If E and F are subsets of X, the notation 


EcF or FDE 
9 
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means that E is a subset of F, i.e. that every point of E belongs 
to F. In particular therefore 


EcE 


for every set E. Two sets E and F are called equal if and only 
if they contain exactly the same points, or, equivalently, if and 
only if 

ECF and FCE. 


This seemingly innocuous definition has as a consequence the 
important principle that the only way to prove that two sets are 
equal is to show, in two steps, that every point of either set be- 
longs also to the other. 

It makes for tremendous simplification in language and nota- 
tion to admit into the class of sets a set containing no points, which 
we shall call the empty set and denote by 0. For every set E 


we have 
OcEcX; 


for every point x we have 
xe’ 0. 


In addition to sets of points we shall have frequent occasion to 
consider also sets of sets. If, for instance, X is the real line, then 
an interval is a set, i.e. a subset of X, but the set of all intervals 
is a set of sets. To help keep the notions clear, we shall always 
use the word class for a set of sets. The same notations and 
terminology will be used for classes as for sets. Thus, for instance, 
if E is a set and E 1s a class of sets, then 


Eek 


means that the set E belongs to (is a member of, is an element of) 
the class E; if E and F are classes, then 


EcF 


means that every set of E belongs also to F, i.e. that E is a sub. 
class of F. 

On very rare occasions we shall also have to consider sets of 
classes, for which we shall always use the word collection. If, 
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for instance, X is the Euclidean plane and E, is the class of all 
intervals on the horizontal line at distance y from the origin, then 
each E, is a class and the set of all these classes is a collection. 


(1) The relation C between sets is always reflexive and transitive; it is sym- 
metric if and only if X is empty. 

(2) Let X be the class of all subsets of X, including of course the empty set 0 
and the whole space X; let x be a point of X, let E be a subset of X (i.e. a member 
of X), and let E be a class of subsets of X (i.e. a subclass of X). If 4 and v vary 
independently over the five symbols x, E, X, E, X, then some of the fifty rela- 
tions of the forms 

uev or uCo 


are necessarily true, some are possibly true, some are necessarily false, and some 
are meaningless. In particular «€0 is meaningless unless the right term is a 
subset of a space of which the left term is a point, and u C o is meaningless 
unless « and v are both subsets of the same space. 


§2. UNIONS AND INTERSECTIONS 


If E is any class of subsets of X, the set of all those points of 
X which belong to at least one set of the class E is called the 
union of the sets of E; it will be denoted by 


UE or U{E:EcE}. 


The last written symbol is an application of an important and 
frequently used principle of notation. If we are given any set of 
objects denoted by the generic symbol x, and if, for each x, 1(*) 
is a proposition concerning x, then the symbol 

{x: a(x)} 
denotes the set of those points x for which the proposition (x) 
is true. If {2,(x)} is a sequence of propositions concerning x, 
the symbol 
{x2 mi(x), a(x), --+} 

denotes the set of those points x for which 7,(x) is true for every 
n=1,2,--+. If, more generally, to every element + of a certain 
index set I’ there corresponds a proposition 7,(*) concerning x, 
then we shall denote the set of all those points x for which the 
proposition 7,(x) is true for every y in T by 


{x: T(x), ve T}. 
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Thus, for instance, 

{x:xeE}=E 
and 

{E: EcE} = E. 


For more illuminating examples we consider the sets 
{:0<¢<5 1} 
(= the closed unit interval), 
{ (x,y): 0? + y? = 1} 
(= the circumference of the unit circle in the plane), and 
{n?:n = 1,2, ---} 


(= the set of those positive integers which are squares). In ac- 
cordance with this notation, the upper and lower bounds (supre- 
mum and infimum) of a set E of real numbers are denoted by 


sup {x:xeE} and inf {x: xe E} 


respectively. 

In general the brace {---} notation will be reserved for the 
formation of sets. Thus, for instance, if x and y are points, then 
{x,y} denotes the set whose only elements are x and y. It is 
important logically to distinguish between the point « and the 
set {x} whcse only element is x, and similarly to distinguish 
between the set E and the class {E} whose only element is E. 
The empty set 0, for example, contains no points, but the class 
{0} contains exactly one set, namely the empty set. 

For the union of special classes of sets various special notations 
are used. If, for instance, 


E= {Z,, E,} 
then 
UE = U {Ei = 1,2} 
is denoted by 
Ey U E23 


if, more generally, 
E={£, +5 En} 
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is a finite class of sets, then 


UE=U{Esi=1,---,n} 
E,U-:- UE, or Ut, E 


If, similarly, {Z,} is an infinite sequence of sets, then the union 
of the terms of this sequence is denoted by 


Fk, UE,U--- or URE 


More generally, if to every element y of a certain index set T 
there corresponds a set £,, then the union of the class of sets 


{Ey:y eT} 


U,er£, or U,E,. 
If the index set T is empty, we shall make the convention that 
U, £, = 0. 


The relations of the empty set 0 and the whole space X to 
the formation of unions are given by the identities 


EVUO=E and EUX=X. 


is denoted by 


is denoted by 


More generally it is true that 


1 Soe os 
if and only if 
EUF=PF. 


If E is any class of subsets of X, the set of all those points of 
X which belong to every set of the class E is called the intersection 
of the sets of E; it will be denoted by 

ME or (\{E: Eck}. 
Symbols similar to those used for unions are used, but with the 
symbol U replaced by f, for the intersections of two sets, of a 
finite or countably infinite sequence of sets, or of the terms of 


any indexed class of sets. If the index set I is empty, we shall 
make the somewhat startling convention that 


Nyer Ey = X. 
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There are several heuristic motivations for this convention, One 
of them is that if T', and I’, are two (non empty) index sets for 
which T, CTs, then clearly 


Nrer: Ey 2 Nyer: E,, 


and that therefore to the smallest possible I, ie. the empty one, 
we should make correspond the largest possible intersection. 
Another motivation is the identity 


Ne erur: Ey td Ni, er, Ey nN Ny, er, Ey, 


valid for all non empty index sets, andT2. If we insist that this 
identity remain valid for arbitrary T, and Tg, then we are com- 
mitted to believing that, for every I, 


Nr Ey = N,eruo Ey = Nrer Ey n N10 Ey; 


writing E, = X for every y in I’, we conclude that 
Nn, £0 Ey = X, 


Union and intersection are sometimes called join and meet, 
respectively. As a mnemonic device for distinguishing between 
U and NM (which, by the way, are usually read as cup and cap, 
respectively), it may be remarked that the symbol U is similar 
to the initial letter of the word ‘‘union” and the symbol N is 
similar to the initial letter of the word “meet.” 

The relations of 0 and X to the formation of intersections are 
given by the identities 


EN0O=0 and ENX=E. 
More generally it is true that 


EcF 
it and only if 
ENF=E., 


Two sets E and F are called disjoint if they have no points in 
common, i.e. if 


EnF=0. 
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A disjoint class is a class E of sets such that every two distinct 
sets of E are disjoint; in this case we shall refer to the union of the 
sets of E as a disjoint union. 

We conclude this section with the introduction of the useful 
concept of characteristic function. If E is any subset of X, the 
function xz, defined for all x in X by the relations 


(x) ( if «cE, 

x) = : 

oe 0 if xe’ EF, 

is called the characteristic function of the set E. The correspond- 
ence between sets and their characteristic functions is one to one, 
and all properties of sets and set operations may be expressed 
by means of characteristic functions. As one more relevant illus- 
tration of the brace notation, we mention 


E = {x: xz(x) = 1}. 


(1) The formation of unions is commutative and associative, i.e. 
EUF=FUE and EU(FUG)=(EUPUG; 


the same is true for the formation of intersections. 
(2) Each of the two operations, the formation of unions and the formation 
of intersections, is distributive with respect to the other, i.e. 


EN (FUG) =(ENAU(ENG 
EU(FNG=(EVUP/N (EUG). 


More generally the following extended distributive laws are valid: 
FON U{E: EeE} = U {EN F: EeE} 


FUP) {E: EcE} =() {EU F: Eek}. 


(3) Does the class of all subsets of X form a group with respect to either of 
the operations U and M? 
(4) xo(x) = 0, xx(~) = 1. The relation 
xa(x) S xr(*) 


is valid for allx in X if and only ifECF. FEN F=Aand EUF=B, 
then 


and 


and 


XA = Xexr =xXe MN xr and xs =xXe+XxXr —X4 = XB U xe. 


(5) Do the identities in (4), expressing xa and xg in terms of xg and xr, 
have generalizations to finite, countably infinite, and arbitrary unions and 
intersections? 
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§3. LIMITS, COMPLEMENTS, AND DIFFERENCES 


If {E,} is a sequence of subsets of X, the set E* of all those 
points of X which belong to &,, for infinitely many values of n 
is called the superior limit of the sequence and is denoted by 


E* = lim sup, Ey. 


The set Ex of all those points of X which belong to E,, for all but 
a finite number of values of 7 is called the inferior limit of the 
sequence and is denoted by 


E, = lim inf, E,. 


If it so happens that 
Ex = E*. 


we shall use the notation 
lim, E, 
for this set. If the sequence is such that 
E, C Ens, for n= 1,2,---, 
it is called increasing; if 
E,> Engi, for n= 1,2,---, 


it is called decreasing. Both increasing and decreasing sequences 
will be referred to as monotone. It is easy to verify that if {E,} 
is a monotone sequence, then lim, £, exists and is equal to 


U. E, or an Fa 
according as the sequence is increasing or decreasing. 

The complement of a subset E of X is the set of all those 
points of X which do not belong to E; it will be denoted by £’. 
The operation of forming complements satisfies the following 
algebraic identities: 

ENE =0, EU EF =X, 
0 =X, (£)’=E, X'=0, and 
if ECF, then EF’ DF’ 


(Src. 3] SETS AND CLASSES 17 


The formation of complements also bears an interesting and very 
important relation to unions and intersections, expressed by the 
identities 


(U {E: Ee E})' = ff) {E': Ee E}, 
(() {E: E eE})' = U {F': E cE}. 


In words: the complement of the union of a class of sets is the 
intersection of their complements, and the complement of their 
intersection is the union of their complements. This fact, together 
with the elementary formulas relating to complements, proves the 
important principle of duality: 
any valid identity among sets, obtained by forming unions, 
intersections, and complements, remains valid if in it the 
symbols 
ft), c, and 0 
are interchanged with 
U, >, and X 


respectively (and equality and complementation are left 
unchanged). 


If E and F are subsets of X, the difference between E and F, 
in symbols 
E-F, 


is the set of all those points of E which do not belong to F. Since 
X-F=F, 
and, more generally, 
E-F=ENF, 
the difference E — F is frequently called the relative complement 
of Fin E. The operation of forming differences, similarly to the 


operation of forming complements, interchanges (J with {) and 
c with >, so that, for instance, 


E-(FUG) =(E-P)n(E-©@). 
The difference E ~ F is called proper if E > F. 
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As the final and frequently very important operation on sets 
we introduce the symmetric difference of two sets E and F, 


denoted by 
EAP, 


and defined by 
EAF=(E-FP)U(F-E)=(ENF)UCE NF). 


The formation of limits, complements, and differences of sets 
requires a bit of practice for ease in manipulation. The reader 
is accordingly advised to carry through the proofs of the most 
important properties of these processes, listed in the exercises 
that follow. 


(1) Another heuristic motivation of the convention 
nN, eoky =X 
is the desire to have the identity 
N1er Ey = (Uyer Ey’, 


which is valid for all non empty index sets I’, remain valid for! = 0. 
(2) If E, = lim inf, E, and E* = lim sup, En, then 


Ex = Use Nin-a Em C A e-1 Unaen Em = E*. 


(3) The superior limit, inferior limit, and limit (if it exists) of a sequence of 
sets are unaltered if a finite number of the terms of the sequence are changed. 
(4) If E, = 4 or B according as 7 is even or odd, then 


lim inf, Ex = 4M B and limsup, E, = 4 U B. 
(5) If {E,} is a disjoint sequence, then 
lim, E, = 0. 
(6) If Ey = lim inf, En and E* = lim sup, Ep, then 
(Ey)! = lim sup, En’ and (£*)’ = lim inf, E,’. 
More generally, 
F — Ey = limsup, (F — Ey) and F — E* = lim inf, (F — E,). 
(7) E-F=E-(ENP=(EUP -F, 
EN (F—-G) =(ENP-(ENG), (EUP-G=(E-GU(F-G. 
(8) (E-G)N (F-G) =(ENF)-G, 
(E-F)-G=E-(FUG, E-(F-G=(E-FHU(ENG, 
(E-F)N(G- A) =(ENG —-(FUA). 
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(9) EAF=FAE, EA(FAG) = (EAF)AG, 
EN (FAG) =(ENFA(ENG, 
EAO=E, EAX=E, 
EAE=0, EAE'=X, 
EAF=(EUF)~(EN PF. 


(10) Does the class of all subsets of X form a group with respect to the opera- 
tion A? 
(11) If Ey = lim inf, E, and E* = lim supa Eq, then 
Xz,(x) = lim inf, xz,(“) and x20(x) = lim supa xz,(*). 


(The expressions on the right sides of these equations refer, of course, to the 
usual numerical concepts of superior limit and inferior limit.) 
(12) xe = 1 — xz, xe-r = Xe(1 — xe), 


Xear = |xe —xel = xe + xe (mad 2). 
(13) If {E,} is a sequence of sets, write 
D,=E;, D2 =D, AE, D3 = Dr AEs, 
and, in general, 
Dati = Da A Enyi for m=1,2,---. 
The limit of the sequence {D,} exists if and only if lim, E, = 0. If the opera- 
tion A is thought of as addition (cf. (12)), then this result has the following 


verbal phrasing: an infinite series of sets converges if and only if its terms ap- 
proach zero. 


§4. RINGS AND ALGEBRAS 


A ring (or Boolean ring) of sets is a non empty class R of sets 


such that if 
EeR and FeR, 


then 
EUFeR and E—FeR. 


In other words a ring is a non empty class of sets which is closed 
under the formation of unions and differences. 
The empty set belongs to every ring R, for if 


EeR, 


then 
O=E—EcR. 
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Since 
E-F=(EUF)-F, 


it follows that a non empty class of sets closed under the formation 
of unions and proper differences is a ring. Since 


EAF=(E-—F)U (F-E) 
and 


ENF=(EUF) -(EAP, 


it follows that a ring is closed under the formation of symmetric 
differences and intersections. An application of mathematical 
induction and the associative laws for unions and intersections 
shows that if R is a ring and 


E;eR, t=1,---, 4”, 
then 


U?r-i#:eR and fh, E:eR. 


Two extreme but useful examples of rings are the class {0} 
containing the empty set only, and the class of all subsets of X. 
Another example, for an arbitrary set X, is the class of all finite 
sets. A more illuminating example is the following. Let 


X = {x1 -0 <x < +o} 


be the real line, and let R be the class of all finite unions of 
bounded, left closed, and right open intervals, i.e. the class of 
all sets of the form 


Ulu {[x: -0 <a; 5% <4; < +o}. 


Union and intersection are treated unsymmetrically in the 
definition of rings. While, for instance, it is true that a ring is 
closed under the formation of intersections, it is not true that a 
class of sets closed under the formation of intersections and dif- 
ferences is necessarily closed also under the formation of unions. 
If, however, a non empty class of sets is closed under the formation 
of intersections, proper differences, and disjoint unions, then it 
is aring. (Proof: 


EUF=(E-(EN F)) U(F-(EN FU (EN P).) 
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It is easily possible to give a definition of rings which is more 
nearly symmetric in its treatment of union and intersection: a 
ring may be defined as a non empty class of sets closed under the 
formation of intersections and symmetric differences. The proof 
of this statement is in the identities: 


EUF=(EAF)A(ENF) E-F=EA(EN F). 


If in this form of the definition we replace intersection by union 
we obtain a true statement: a non empty class of sets closed under 
the formation of unions and symmetric differences is a ring. 

An algebra (or Boolean algebra) of sets is a non empty class 
R of sets such that 


(a) ifEeRandF eR, then E U F eR, and 
(b) if Ee R, then FE’ eR. 


Since 
E-F=ENFP=(£ UF), 


it follows that every algebra is a ring. The relation between the 
general concept of ring and the more special concept of algebra is 
simple: an algebra may be characterized as a ring containing X. 


Since 
E’'=xX-E, 


it is clear that every such ring is an algebra; if, conversely, R 
is an algebra and 


EeR 
(we recall that R is non empty), then 
X=EUE’'eR. 


(1) The following classes of sets are examples of rings and algebras. 
(1a) X is n-dimensional Euclidean space; E is the class of all finite unions of 
semiclosed “‘intervals” of the form 


{(x1, 009, tn) —O <a; Sei <b <0, F= 1, +++, ny}. 


(1b) X is an uncountable set; E is the class of all countable subsets of X. 

(1c) X is an uncountable set; E is the class of all sets which either are count- 
able or have countable complements. 

(2) Which topological spaces have the property that the class E of open sets 
is a ring? 
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(3) The intersection of any collection of rings or algebras is again a ring or an 
algebra, respectively. 
(4) If R is a ring of sets and if we define, for E and F inR, 


EOF=ENF and EOF=EAF, 


then, with respect to the operations of “‘addition” (®) and “multiplication” 
(©), the system R is a ring in the algebraic sense of the word. Algebraic rings, 
such as this one, in which every element is idempotent (i.e. E © E = E for 
every E£ in R) are also called Boolean rings. The existence of a very close rela- 
tion between Boolean rings of sets and Boolean rings in general is the main 
justification of the ring terminology in the set theoretic case. 

(5) If R is a ring of sets and if A is the class of all those sets E for which 


either EeR orelse E’ eR, 
then A is an algebra. 
(6) A semiring is a non empty class P of sets such that 


(6a) if Ee Pand FeP, then Ef FeP, and 

(6b) if Ee Pand Fe Pand EC F, then there is a finite class {Co, Ci, ---, Ca} 
of sets in P such that E= Cy) CC, C---CC, = F and D; = C; — 
Cy..eP fori = 1, ---, 2. 


The empty set belongs to every semiring. If X is any set, then the class P 
consisting of the empty set and all one-point sets (i.e. sets of the form {x} with 
xe X)isasemiring. If X is the real line, the class of all bounded, left closed, and 
right open intervals is a semiring. 


§ 5. GENERATED RINGS AND o—RINGS 


Theorem A. Jf E is any class of sets, then there exists a 
unique ring Ro such that Ryo DE and such that if R is any 
other ring containing E then Ro CR. 


The ring Ro, the smallest ring containing E, is called the ring 
generated by E; it will be denoted by R(E). 

Proof. Since the class of all subsets of X is a ring, it is clear 
that at least one ring containing E always exists. Since more- 
over (cf. 4.3) the intersection of any collection of rings is also a 
ring, the intersection of all rings containing E is easily seen to be 
the desired ring Ro. J 


Theorem B. Jf E is any class of sets, then every set in R(E) 
may be covered by a finite union of sets in E. 
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Proof. The class of all sets which may be covered by a finite 
union of sets in E is a ring; since this ring contains E, it also con- 
tains R(E). § 


Theorem C. Jf Eis a countable class of sets, then R(E) is 
countable, 


Proof. For any class C of sets, we write C* for the class of all 
finite unions of differences of sets of C. It is clear that if C is 
countable, then so is C*, and if 


0eC, 
then 
Cc C*. 


To prove the theorem we assume, as we may without any loss 
of generality, that 

OcE, 
and we write 


E, = E, E, = Rs n=1,2,---. 


It is clear that 
Ec U;..E, c R(®), 


and that the class 
Uno En 


is countable; we shall complete the proof by showing that U*7_> En 
is aring. Since 


E=E,cE, CE. c:-., 


it follows that if 4 and B are any two sets in 2.0 E,, then there 
exists a positive integer 7 such that both 4 and B belong to E,. 
We have 
Aa = B © En41) 
and, since 
Oe Ep Cc E,, 
it follows also that 


AU B= (4 —0) U (B—0) eEnut. 
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We have proved therefore that both 4 — B and 4 U B belong 
to Ur.o En, ie. that Ut.oE, is indeed closed under the forma- 
tion of unions and differences. Jf 


Ao-ring is a non empty class S of sets such that 


(a) if EeSandFc§S, then E — Fc§, and 
(b) if E,eS,i = 1,2, ++, then U2, EeS. 


Equivalently a o-ring is a ring closed under the formation of 
countable unions. If S is a o-ring and if 


E;,eS, i=1,2,---, and E= UE, 
then the identity 


fel E; =E-— Ui: (E = E;) 
shows that 
Nin E; eS, 


i.e. that a o-ring is closed under the formation of countable inter- 
sections. It follows also (cf. 3.2) that if S is a o-ring and 


E; e§, i= 1,2, ees 


then both lim inf; Z; and lim sup; E; belong to S. 

Since the truth and proof of Theorem A remain unaltered if 
we replace “ring” by “o-ring” throughout, we may define the 
o-ring S(E) generated by any class E of sets as the smallest 
o-ring containing E. 


Theorem D. /fEis any class of sets and E is any setinS = 
S(E), then there exists a countable subclass D of E such that 
EeS(D). 


Proof. The union of all those o-subrings of S which are 
generated by some countable subclass of E is a o-ring containing 
E and contained in §S; it is therefore identical with S. J 

For every class E of subsets of X and every fixed subset 4 
of X, we shall denote by 

ENA 


the class of all sets of the form EN 4 with E in E. 


(Sec. 5] SETS AND CLASSES 25 
Theorem E. Jf E is any class of sets and if A is any subset 
of X, then 
SE) N 4=S(EN J. 
Proof. Denote by C the class of all sets of the form B U 
(C — A), where 
BeSENA) and CeS); 


it is easy to verify that C is a o-ring. If E cE, then the relation 


E=(EN 4) U (E- 4), 
together with 


ENAcENACSEN YD, 
shows that E eC, and therefore that 


EcC. 
It follows that 
SE) <C 
and therefore that 


SE) NAcCNA. 
Since, however, it is obvious that 


CNda=SENYD, 
it follows that 


S®@) nN 4cSEN 4%. 
The reverse inequality, 
SEN 4) cS) N 4, 
follows from the facts that S(E) N 4 is a o-ring and 
ENAcSEH) NZ. J 


(1) For each of the following examples, what is the ring generated by the 
class E of sets there described? 

(1a) For a fixed subset E of X, E = {E} is the class containing E only. 

(1b) For a fixed subset E of X, E is the class of all sets cf which E is a subset, 
ie E = {F: EC Fh. 

(1c) E is the class of all sets which contain exactly two points. 

(2) A lattice (of sets) is a class L such that Oe L and such that if Ee L and 
Fel, then EU FeL and EN Fel. Let P = P(L) be the class of all sets 
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of the form F — E, where EeL, Fel, and EC F; then P is a semiring; 
(cf. 4.6). (Hint: if 
D;=F,- E, 1=1,2 


are representations of two sets of P as proper differences of sets of L, and if 
D; D Ds, then for 
C= (Fi N Fx) — (Ai N Fa), 


or, alternatively, for 


Cc =, Fi = [Zi U (Fi nN E,)\, 


we have Fp - FE, CCC F,— £j.) Is Paring? 

(3) Let P be a semiring and let R be the class of all sets of the form U?., E,, 
where { Fi, ---, En} is an arbitrary finite, disjoint class of sets in P. 

(3a) R is closed under the formation of finite intersections and disjoint 
unions. 

(3b) If Ee P, Fe P, and ECF, then F— EcR. 

(3c) If Ee P, Fe R, and EC F, then F— EeR. 

(3d) If Ee R, Fe R, and ECF, then F—EeR. 

(3e) R = R(P). It follows in particular that a semiring which is closed under 
the formation of unions is a ring. 

(4) The fact that the analog of Theorem A for algebras is true may be seen 
either by replacing “ring” by “algebra” in its proof or by using 4.5. 

(5) If P is a semiring and R = R(P), then S(R) = S(P). 

(6) Is it true that if a non empty class of sets is closed under the formation of 
symmetric differences and countable intersections, then it is a o-ring? 

(7) IfE is a non empty class of sets, then every set in S(E) may be covered by 
a countable union of sets in E; (cf. Theorem B). 

(8) If E is an infinite class of sets, then E and R(E) have the same cardinal 
number; (cf. Theorem C). 

(9) The following procedure yields an analog of Theorem C for o-rings; 
(cf. also (8)). If E is any class of sets containing 0, write Eo = E, and, for any 
ordinal a > 0, write, inductively, 


E. = (U (Es: 8B < a})*, 


where C* denotes the class of all countable unions of differences of sets of C. 
(9a) If0 <a <f, thn ECE, CE; CS(&). 
(9b) If Q is the first uncountable ordinal, then S(E) = U{E.: a < Qh. 
(9c) If the cardinal number of E is not greater than that of the continuum, 
then the same is true of the cardinal number of S(E). 
(10) What are the analogs of Theorems D and E for rings instead of o-rings? 


§ 6. MONOTONE CLASSES 


It is impossible to give a constructive process for obtaining the 
o-ring generated by a class of sets. By studying, however, 
another type of class, less restricted than a o-ring, it is possible 
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to obtain a technically very helpful theorem concerning the 
structure of generated o-rings. 

A non empty class M of sets is monotone if, for every monotone 
sequence {£,} of sets in M, we have 


lim, E, eM. 


Since it is true for monotone classes (just as for rings and 
o-rings) that the class of all subsets of X is a monotone class, 
and that the intersection of any collection of monotone classes 
is a monotone class, we may define the monotone class M(E) 
generated by any class E of sets as the smallest monotone class 
containing E. 


Theorem A. 4 o-ring is a monotone class; a monotone ring 
1s @ o-ring. 
Proof. The first assertion is obvious. To prove the second 
assertion we must show that a monotone ring is closed under 
the formation of countable unions. If M is a monotone ring 


and if 
E; eM, i= 1, 2, S888 5 


then, since M is a ring, 
Ui-.EieM, 2 =1,2,---. 


Since {(U?., Z,} is an increasing sequence of sets whose union is 
Uj... Z,, the fact that M is a monotone class implies that 


Ur. E; eM. | 


Theorem B. Jf R is a ring, then M(R) = S(R). Hence 
if a monotone class contains a ring R, then it contains S(R). 


Proof. Since a o-ring is a monotone class and since S(R) D 
R, it follows that 
S(R) > M = M(R). 


The proof will be completed by showing that M is a o-ring; it 
will then follow, since M(R) > R, that M(R) > S(R). 

For any set F let K(F) be the class of all those sets E for which 
E-F, F-—E, and EUF are all in M. We observe that, 
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because of the symmetric roles of E and F in the definition of 
K(F), the relations 


EeK(F) and FeK(E) 


are equivalent. If {£,} is a monotone sequence of sets in K(F), 


then 
lim, E, — F = lim, (E, — F) eM, 


F — lim, EZ, = lim, (F — E,) eM, 
F U lim, Z, = lim, (F U £,) eM, 


so that if K(F) is not empty, then it is a monotone class. 

If E eR and F eR, then, by the definition of a ring, EF e K(F). 
Since this is true for every E in R, it follows that R Cc K(F), 
and therefore, since M is the smallest monotone class containing 
R, that 

M c K(F). 


Hence if E e Mand F eR, then E ce K(P), and therefore F ¢ K(E£). 
Since this is true for every F in R, it follows as before that 


M c K(£). 


The validity of this relation for every E in M is equivalent to the 
assertion that M is a ring; the desired conclusion follows from 
Theorem A. § 

This theorem does not tell us, given a ring R, how to construct 
the generated o-ring. It does tell us that, instead of studying 
the o-ring generated by R, it is sufficient to study the monotone 
class generated by R. In many applications that is quite easy 
to do. 


(1) Is Theorem B true for semirings instead of rings? 

(2) A class N of sets is normal if it is closed under the formation of countable 
decreasing intersections and countable disjoint unions. A o-ring is a normal 
class; a normal ring is a o-ring. 

(3) If the smallest normal class containing a class E is denoted by N(B), 
then, for every semiring P, N(P) = S(P). 

(4) If ao-algebra of sets is defined as a non empty class of sets closed under 
the formation of complements and countable unions, then a o-algebra is a 
o-ring containing X. If R is an algebra, then M(R) coincides with the smallest 
o-algebra containing R. Is this result true if R is a ring? 
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(5) For each of the following examples what is the o-algebra, the o-ring, 
and the monotone class generated by the class E of sets there described? 

(Sa) Let X be any set and let P be any permutation of the points of X, i.e. 
P is a one to one transformation of X onto itself. A subset E of X is invariant 
under P if, whenever x e F, then P(x) e E and P-!(x)e E. Let E be the class 
of all invariant sets. 

(5b) Let X and Y be any two sets and let T be any (not necessarily one to 
one) transformation defined on X and taking X into Y. For every subset E 
of Y denote by 7~!(Z) the set of all points x in X for which T(x) ce E. Let E 
be the class of all sets of the form T~!(Z), where E varies over all subsets of Y. 

(5c) X is a topological space; E is the class of all sets of the first category. 

(5d) X is three dimensional Euclidean space. Let a subset E of X be called 
a cylinder if whenever (x,y,z) e E, then (x,y,£)eE for every real number 4. 
Let E be the class of all cylinders. 

(Se) X is the Euclidean plane; E is the class of all sets which may be covered 
by countably many horizontal lines. 


Chapter II 


MEASURES AND OUTER MEASURES 


§ 7. MEASURE ON RINGS 


A set function is a function whose domain of definition is a 
class of sets. An extended real valued set function p» defined on a 
class E of sets is additive if, whenever 


EeE, FeE, EUFeE, and ENF=0, 
then 
BE U F) = p(E) + w(F). 
An extended real valued set function » defined on a class E is 
finitely additive if, for every finite, disjoint class {F,, ---, En} 
of sets in E whose union is also in E, we have 


u(Uin Es) = Dots w(Z). 


An extended real valued set function » defined on a class E is 
countably additive if, for every disjoint sequence {E,} of sets in 
E whose union is also in E, we have 


mc 6 pen E,) = eee u(E,). 


A measure is an extended real valued, non negative, and countably 
additive set function y, defined on a ring R, and such that »(0) = 0. 
We observe that, in view of the identity, 


Ut. E; = £, U---UE, UOU0U--, 


a measure is always finitely additive. A rather trivial example 
of a measure may be obtained as follows. Let / be an extended 
30 
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real valued, non negative function of the points of a set X. Let 
the ring R consist of all finite subsets of X; define » by 


wai, <<; ta}) = Dhar fle) and pO) = 0. 


Less trivial examples will be presented in the following sections. 

If « is a measure on a ring R, a set E in R is said to have finite 
measure if u(Z) < «; the measure of E is o-finite if there exists 
a sequence {£,} of sets in R such that 


Ec Ur: E, and u(E,,) <2, n= 1, 2, eye 


If the measure of every set E im R is finite [or o—finite], the measure 
pis Called finite [oro—finite] on R. If X eR (i.e. if R is an algebra) 
and u(X) is finite or o-finite, then p is called totally finite or 
totally o~finite respectively. The measure p is called complete 
if the conditions 


EeR, FCE, and p»(£) =0 
imply that F eR. 


(1) If « is an extended real valued, non negative, and additive set function 
defined on a ring R and such that u(Z) < © for at least one £ inR, then (0) = 0. 

(2) If E is a non empty class of sets and yw is a measure on R(E) such that 
pw(E) < © whenever Ec E, then p is finite on R(B); cf. 5.B. 

(3) If » is a measure on a o-ring, then the class of all sets of finite measure 
is a ring and the class of all sets of c-finite measure is a o-ring. If, in addition, 
p# is o-finite, then a necessary and sufficient condition that the class of all sets 
of finite measure be a o-ring is that y be finite. Is the latter statement true if 
p» is not o-finite? 

(4) Suppose that » is a measure on a o-ring S and that E is a set in S of 
o-finite measure. If D is a disjoint class of sets in S, then n(E N D) € 0 for 
at most countably many sets D in D. (Hint: assume first that u(E) < ©; for 
each positive integer 7 consider the class 


{D: DeD, w(END) 2/1.) 


(5) If « is an extended real valued, non negative, and additive set function 
defined on a ring R and such that #(0) = 0, then p is finitely additive. The proof 
of the same statement for a semiring P is not trivial; it may be achieved by the 
following considerations. A finite, disjoint class {#i1, ---, En} of sets in P 
whose union, E£, is also in P is called a partition of E. The partition { £,} is called 
a p-partition if, for every F in P, 


BEN F) = Dt w(E N F). 
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If {Z;} and {F;} are partitions of E, then {£;} is called a subpartition of {F;} 
if each set E; is contained in one of the sets Fj. 

(5a) If {E,} and {F;} are partitions of E, then so is their product, consisting 
of all sets of the form E; N F;. 

(5b) If a subpartition of a partition {Z;,} is a u-partition, then {E,} is a 
u—-partition. 

(5c) The product of two p-partitions is a y—partition. 

(5d) IfFE=CQyo CC, C---CC, = F, where C;e P, § = 0, 1, ---, n, and if 


D; =C;- Cy-1e P, i='l, seem, 


then {£, Di, ---, Da} is a u-partition of F. 
(5e) Every partition of a set E in P is a y-partition. 


§ 8. MEASURE ON INTERVALS 


In order to motivate and illustrate the elementary notions of 
measure theory, we now propose to discuss an important and 
classical special case. Throughout this section the underlying 
space X is to be the real line. We shall denote by P the class of 
all bounded, left closed, and right open intervals, i.e. the class of 
all sets of the form 


{x: -ao << aSx<b<o}; 


we shall denote by R the class of all finite, disjoint unions of sets 
of P, i.e. the class of all sets of the form 


Ulei {x1 -2 <a; Sx <4; < o}. 


(It is easy to verify that any union of this form may be written 
as a disjoint union of the same form.) 

For simplicity of language we shall always use the expression 
“‘semiclosed interval’ instead of “bounded, left closed, and right 
open interval.’”’ The consideration of semiclosed intervals, instead 
of open intervals or closed intervals, is a technical device. If, for 
instance, a, b, c, and d are real numbers, -~7 <a <b<c<d 
< ~, then the difference between the open intervals 


{x:a<x<d} and {x:b<x<c} 


is neither an open interval nor a finite union of open intervals, 
and the same negative statement holds for the corresponding 
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closed intervals. The fact that semiclosed intervals are better 
behaved in this respect is what makes them desirable. 
We shall, as usual, write [2,4] for a closed interval, 


[2,2] = {x:2S x 5}, 
[2,5) for a semiclosed interval, 
[2,b) = {x:a Sx <d}, 
and (4,4) for an open interval, 
(a,b) = {x:a<x <b}. 
In writing any of these symbols it shall always be understood that 
° ae class P of semiclosed intervals we define a set function 


mu by 
p([a,)) = b — a. 


We observe that when a = 4, the interval reduces to the empty 
set, so that 
20) = 0. 


We proceed to investigate the relation of the set function yu to 
some set theoretic notions in P, 


Theorem A. If {E,, ---, En} is a finite, disjoint class of 
sets in P, each contained in a given set Ey in P, then 


Di-1 u(Ei) S (Eo). 


Proof. We write E; = [a,,4;), i = 0, 1, ---, m, and, without 
any loss of generality, we assume that 
4 3°°-S apy. 
It follows from the assumed properties of {E,, ---, E,} that 
aSa8548::-S4a,5h,8 b, 
and therefore 
Di-1 w(E;) = Din @ — a) S 
S01 G - a) + DMT Ga - 4) = 
= by — a 3 bo — ao = w(Ed). J 


34 MEASURES AND OUTER MEASURES [Sec. 8} 


Theorem B. Jf a closed interval Fy, Fy = |ao,bo), is con- 
tained in the union of a finite number of bounded, open inter- 
vals, U,, +--+, Un, Us = (@i,b,), i = 1, +++, 0, then 


by — ao < Dilar (4: — @;). 


Proof. Let k; be such that a) e¢U,,. If d:, S bo, then let ke 
be such that 4, © U;,3 if dz, S 40, then let kz be such that 4,, © U4, 
and so on by induction. The process stops with km if d,, > do. 
There is no loss of generality in assuming that m = nand U,, = U; 
for i = 1, +++, m, because this state of affairs may be achieved 
merely by omitting superfluous U,’s and changing the notation. 


In other words we may (and do) assume that 
a <a9<h, an < do <b, 
and, in case x > 1, 
@in1 <b; < bigs for i= 1,---,n -1; 
it follows that 
by — a < bn — & = (4) — %) + Disisn-1 Gin — 4) S 
S D716; - 4). | 


Theorem C. Jf {Eo, E,, Eo, ---} is a sequence of sets in 
P, such that 
Eo < U?-1 E, 
then 
m(Eo) S Dives w(EX). 


Proof. We write E; = [2;,0;), i= 0, 1, 2, oe, Tf a), = bo, 
the theorem is trivial; otherwise let « be a positive number such 
that « < by — ao. If we write, for any positive number 6, 


é 
Fo = [a0, 40 — «| and U;= (354), eee Oe ee 


then 
iyo c Ui: U; 


and therefore, by the Heine—Borel theorem, there is a positive 
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integer n such that Fy C U., U;. From Theorem B we obtain 
6 
H(Eo) — € = (bo — ao) —€ < Ee (4 - a+ =) Ss 
S Die w(E,) + 6. 


Since e and 6 are arbitrary, the conclusion of the theorem fol- 
lows. ff 


Theorem D. The set function p is countably additive on P. 


Proof. If {£;} is a disjoint sequence of sets in P whose union, 
E, is also in P, then from Theorem A we have 


Die. (ES u(E) for n=1,2,---. 


It follows that 
Dini H(E,) S u(E); 


an application of Theorem C completes the proof. Jf 


Theorem E. There exists a unique, finite measure ji on the 
ring R such that, whenever E eP, i(E) = p(E). 


Proof. We know that every set E in R may be represented as 
a finite, disjoint union of sets in P. Suppose that 


E= LG heer E; and E= Uni F; 


are two such representations of the same set E. Then, for each 
t#=1,---,n, 
E,= Us-1 (E; N Fj) 


is a representation of the set EZ; in P as a finite, disjoint union of 
sets in P, and therefore, since uy is finitely additive, 


pe u(E;) = m1 jel u(E; nN F;). 
Similarly, of course, we have 
Dae M(F;) = Difer Dia w(E; N F)). 


It follows that, for every E in R, the function a is unambiguously 
defined by the equation 


iE) = Diet u(E; ’ 
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where {E,, ---, En} is a finite, disjoint class of sets in P whose 
union is E. 

It is clear from its definition that the function @ thus defined 
coincides with u on P and is finitely additive. Since any function 
satisfying these conditions must in particular be finitely additive 
when the terms of the union to be formed are in P, it is also clear 
that @ is unique. The only non trivial thing left to prove is that 
fi is countably additive. 

Let {E;} be a disjoint sequence of sets in R whose union, E, 
is also in R; then each £; is a finite, disjoint union of sets in P, 


E;= U; £5 and a(E) = Do; u(Ei). 


If E «P, then, since the class of all E;; is countable and disjoint, 
it follows from the countable additivity of » that 


M(E) = w(E) = Dis Diy w(Ey) = Ds a(E)). 
In the general case E is a finite, disjoint union of sets in P, 
E=U.h:, 
and, using the result just obtained, we have 
H(E) = De a(Fi) = De Ds w(E: NM) = 
= Di De w(E: NF) = Dial(E). | 


In view of Theorem E we shall, as we may without any possi- 
bility of confusion, write u(Z) instead of a(Z) even for sets E 
which are in R but not in P. 


(1) In the notation of the proof of Theorem D, let E,, be that term of the 
sequence { £,} whose left end point is the left end point of £; let E,, be the term 
whose left end point is the right end point of Z,,, and soon. It may be shown, 
without using Theorems A, B, and C, that 


Uti #n,eP and p(Uf1 2a) = ii n(F,). 


(2) An alternative proof of Theorem D (which does not use Theorems A, B, 
and C) proceeds by arranging the terms of the sequence {£;} in the order of 
magnitude of their left end points and then applying transfinite induction; 
cf. (1). 


(3) Let g be a finite, increasing, and continuous function of a real variable, 


and write 
Ue([a,4)) = g(4) — g(a). 
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Theorems D and E remain true if u is replaced by py. 
(4) Theorems D and E may be generalized to n-dimensional Euclidean space 
by considering “intervals” of the form 
E = {(1, +++, en): a; Sai <b, i= 1, +--+, nh, 
and defining » by 
w(E) = JT f-1 (6; — ai). 


(5) If wis a countably additive and non negative set function on a semiring 
P, such that u(0) = 0, then there is a unique measure ff on the ring R(P) such 
that, whenever Ee P, f(E) = u(E). If yp is [totally] finite or o-finite, then so 
is fi; (cf. 5.3 and the proof of Theorem E). 


§9. PROPERTIES OF MEASURES 


An extended real valued set function p on a class E is monotone 
if, whenever Ee E, Fe E, and ECF, then n(F) S w(F). An 
extended real valued set function » on a class E is subtractive 
if, whenever EcE, FeE, ECF, F —- Eck, and | u(E)| < ~, 


then 
u(F — E) = p(F) — p(E). 
Theorem A. Jf u is a measure on a ring R, then u is mono- 
tone and subtractive. 


Proof. IfE eR,F eR,andz# Cc F,thenF — Ec Randy(P) = 
h(E) + w(F — £). The fact that » is monotone follows now from 
the fact that it is non negative; the fact that it is subtractive 
follows from the fact that »(£), if it is finite, may be subtracted 
from both sides of the last written equation. J 


Theorem B. If u is a measure on a ring R, if E eR, and 
if {E,} is a finite or infinite sequence of sets in R such that 


Ec U;&,, then 
mM(E) S Dis w(Z). 


Proof. We make use of the elementary but important fact 
that if {F;} is any sequence of sets in a ring R, then there exists 
a disjoint sequence {G;} of sets in R such that 


G;c P; and U:G; = U: F.. 
(Write G; = F; — U {[Fj: 1 sj < i}.) Applying this result to 
the sequence {E N E;,}, the desired result follows from the count- 
able additivity and monotoneness of ». fj 
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Theorem C. Jf wis a measure ona ring R, if E eR, and if 
{E;} ts a finite or infinite disjoint sequence of sets in R such 
that U: E;,c E, then 


Di w(E;) S (EZ). 


Proof. If the sequence {£,} is finite, then J; EZ; eR, and it 
follows that 


Di H(E) = w(U: ED) S w(Z). 


The validity of the inequality for an infinite sequence of sets is a 
consequence of its validity for every finite subsequence. Jf 


Theorem D. If u is a measure on a ring R and if {E,} is 
an increasing sequence of sets in R for which lim, E, € R, then 
a(lim, E,) = lim, p(Z,). 


Proof. If we write Ey = 0, then 
u(lim, En) = o(Ui1 £) = o(Uis (Ei — Bin) = 
= Die mE; — Ex) = lim, Di-1 w(Ey — Eva) = 
= lim, (Ute (EZ; — Es-1)) = lim, w(Z,). | 


Theorem E. [fu is a measure on a ring R, and if {E,} is a 
decreasing sequence of sets in R of which at least one has finite 
measure and for which lim, E, eR, then u(lim, E,) = 
lim, B(E,). 


Proof. If u(En) <, then u(E,) S w(E,) < © for n 2m, 
and therefore u(lim, En) < ©. It follows from Theorems A 
and D, and the fact that {E,, — E,} is an increasing sequence, 
that 


(Em) — w(lim, Z,) = u(Z, — lim, E,) = 
= p(lima (Em — En)) = lima u(En — En) = 
= lima (u(En) — u(En)) = 
= p(En) — lim, u(E,). 


Since u(E,,) < ©, the proof of the theorem is complete. J 
We shall say that an extended real valued set function p de- 
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~fined on a class E is continuous from below at a set E (in E) if, 
for every increasing sequence {E,} of sets in E for which 
lim, E, = E, we have lima u(E,) = w(E). Similarly » is con- 
tinuous from above at E if, for every decreasing sequence { E,} 
of sets in E for which | u(Em) | < & for at least one value of m 
and for which lim, E, = E, we have lim, u(E,) = w(E). Theo- 
rems D and E assert that if u is a measure, then yu is continuous 
from above and from below (at every set in the ring of definition 
of »); the following result goes in the converse direction. 


Theorem F. Let u be a finite, non negative, and additive 
set function on aring R. If w is either continuous from below 
at every FE inR, or continuous from above at 0, then u is a measure 
on R. 


Proof. We observe first that the additivity of uw, together 
with the fact that R is a ring, implies, by mathematical induction, 
that u is finitely additive. Let {E,} be a disjoint sequence of 
sets in R, whose union, £, is also in R and write 


F,, = U?-1 E;, G, = E ioe F,. 


If 4 is continuous from below, then, since {F,} is an increasing 
sequence of sets in R with lim, F, = E, we have 


u(E) = lim, w(Fn) = lim, Die #(Es) = Dives w (Zi). 


If 4 is continuous from above at 0, then, since {G,} is a decreas- 
ing sequence of sets in R with lim, G, = 0, and since uy is finite, 
we have 


a(E) = (Sofa1 w(Es)) + u(Gr) = 
= lim, Dof-1 w(E,) + lim, u(Ga) = Doe w(Ey). | 
(1) Theorems A, B, C, D, and E are true for semirings in place of rings. 
The proofs may be carried out directly or they may be reduced to the correspond- 
ing results for rings by means of 8.5. 


(2) If u is a measure on a ring R, and if E and F are any two sets in R, then 


w(E) + w(K) = WE U F) + H(EN FP). 
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If E, F, and G are any three sets in R, then 


w(E) + u(F) + u(G) + HEN FNG) = 
=w(E U FUG) +H(EN F) +eH(F NG) +uGN Z). 


These statements may be generalized to any finite union. 

(3) If u is a measure on a ring R, and E£ and F are sets in R, we write E~ F 
whenever u(E A F) = 0. The relation “~” is reflexive, symmetric, and transi- 
tive; if E~ F, then w(E) = u(F) = w(E N F). Is the class of all those sets 
EinR for which E~ 0 a ring? 

(4) Continuing in the notation of (3), we write p(E,F) = w(E AF). Then 
p(E,F) 2 0, p(E,F) = (FE), and p(E,F) S (EG) + (GF). If Li~ Ee 
and F; ~ Fo, then p(F1,F 1) = p(Eo,F2). 

(5) The following generalizations of Theorems D and E are valid. If wis a 
measure on a ring R and if {Eq} is a sequence of sets in R for which 


Nien Z:eR, n=1,2,--- and liminf, Zn = URo1 hn EveR, 
then (lim inf, Z,) S lim inf, w(Z,). If, similarly, 
UinEieR, 2 =1,2,--+ and limsup, Z, = (\o-1 Ui. ZieR, 


and if u(Ui, £,) < © for at least one value of , then u(lim sup, En) = 
lim supa u(E,). 

(6) Under the hypotheses of the second part of (5), if >>? u(E,) < ©, then 
(lim sup, £,) = 0. 

(7) Let X be the set of all rational numbers x for which 0 S x S 1, and let 
P be the class of all “‘semiclosed intervals” of the form {x: xe X, a5 x < 4}, 
where 0 S a S 4S 1, anda and bare rational. Define u on P by 


B{x:¢ Sx <b}) =b-a., 


The set function y is finitely additive and continuous from above and below 
but it is not countably additive, so that Theorem F is not true for semirings in 
place of rings. 

(8) Let X be the set of all positive integers and let R be the class of all finite 
sets and their complements. For E in R write u(E) = 0 or w(EZ) = & according 
as E is finite or infinite. The set function p is continuous from above at 0 but 
it is not countably additive, so that the second half of Theorem F is not true 
if infinite values are admitted. 

(9) Is Theorem E true without the finiteness condition described in its 
statement? 

(10) If is a measure on the Borel sets of a separable, complete, metric space 
X such that w(X) = 1, then there exists a subset E of X such that £& is a count- 
able union of compact sets and such thatu(Z) = 1. (Hint: let {x,} be a sequence 
of points dense in X and write U,* for the closed sphere of radius i with center 
atxn If0 <e < land Fnt = U1 U;F, let m, be defined inductively as the 
smallest positive integer for which 


BNE Fm,*) > 1 — 
If C = ()fn1 Fm’, then C is compact and u(C) & 1 — e) 
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§10. oUTER MEASURES 


A non empty class E of sets is hereditary if, whenever E cE 
andF cE, then FekE. 

A typical example of a hereditary class is the class of all sub- 
sets of some subset E of X. The part of the algebraic theory of 
hereditary classes that we shall need is very easy and it is similar 
in every detail to the theories of rings, —-rings, and other classes 
of sets we have considered. It is, in particular, true that the 
intersection of every collection of hereditary classes is again a 
hereditary class, and that, therefore, corresponding to any class 
of sets, there is a smallest hereditary class containing it. We 
shall be especially interested in hereditary classes which are 
o-rings; it is easy to see that a hereditary class is a o-ring if and 
only if it is closed under the formation of countable unions. If 
E is any class of sets, we shall denote the hereditary o-ring 
generated by E, i.e. the smallest hereditary o-ring containing E, 
by H(E). The hereditary o-ring generated by E is, in fact, the 
class of all sets which can be covered by countably many sets in 
E; if E is anon empty class closed under the formation of countable 
unions (for instance if E is a o-ring), then H(E) is the class of all 
sets which are subsets of some set in E. 

An extended real valued set function u* defined on a class E 
of sets is subadditive if, whenever Ee E, Fe E, and EU F ek, 
then 

(EU F) S w*(E) + w*(F). 


An extended real valued set function p* on E is finitely subadditive 
if, for every finite class {E,, ---, E,} of sets in E whose union is 
also in E, we have 


u*(Uie1 BE) S Doh w*(E)). 


An extended real valued set function u* on E is countably syb- 
additive if, for every sequence {E;} of sets in E whose union is 
also in E, we have 


e*(Ui1 £) < D1 w*(E). 
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An outer measure is an extended real valued, non negative, mono- 
tone, and countably subadditive set function y*, defined on a 
hereditary o-ring H, and such that u*(0) = 0. We observe that 
an outer measure is necessarily finitely subadditive. The same 
terminology concerning [total] finiteness and o-finiteness is used 
for outer measures as for measures. 

Outer measures arise naturally in the attempt to extend meas- 
ures from rings to larger classes of sets. The first precise formula- 
tion of some of the details is contained in the following statement. 


Theorem A. If » is a measure on a ring R and if, for every 
E in H(R), 


u*(E) = inf {Diner w(Z,): Z, eR, 2 = 1,2, --+5 
Ec Us En}, 


then w* is an extension of w to an outer measure on H(R); if 
u 1s [totally] o—finite, then so is p*. 


Verbally 4*(EZ) may be described as the lower bound of sums 
of the type >>7_: u(E,), where {E,} is a sequence of sets in R 
whose union contains E. The outer measure y* is called the outer 
measure induced by the measure u. 

Proof. If EeR, then Ec EUQUOU--- and therefore 
u*(E) S w(E) + uO) + nO) +---= w(Z). On the other hand 
if EeR, E,eR,n = 1,2, ---, and Ec U7, E,, then, by 9.B, 
H(E) S DOr. u(E,), so that w(Z) S p*(E). This proves that 
n* is indeed an extension of y, ie. that if E eR, then p*(£) = 
p(E); it follows in particular that »*(0) = 0. 

If Ee H(R), Fe H(R), ECF, and {E,} is a sequence of sets 
in R which covers F, then {E,} also covers E, and therefore 
p*(E) S w*(P). 

To prove that u* is countably subadditive, suppose that E and 
E; are sets in H(R) such that E c Uj, E:; let ¢ be an arbitrary 
positive number, and choose, for each 7 = 1, 2, ---, a sequence 
{E,;} of sets in R such that 


E;c Uji Ej and Dj, u(Ey) S o*(E) + 


[Sec. 10} MEASURES AND OUTER MEASURES 43 


(The possibility of such a choice follows from the definition of 
u*(E;).) Then, since the sets E;; form a countable class of sets 
in R which covers E, 


u*(E) S Deter Die (Ey) S Die w*(Z) +e 
The arbitrariness of ¢ implies that 
p*(E) Ss int p*(E,). 


Suppose, finally, that u is o—finite and let E be any set in H(R). 
Then, by the definition of H(R), there exists a sequence {£;} 
of sets in R such that Ec Uj. £;. Since u is o-finite, there 
exists, for each i = 1, 2, ---, a sequence {£;;} of sets in R such 


that 
E;c Ufa Ey and p(Ey) <@. 


Consequently 
Ec Uri Ujer Ey and p*(Eyj) = w(Ey) <0. ff 


(1) Is it necessarily true, under the hypotheses of Theorem A, that if uw is 
finite, then so is p*? 

(2) For any class E of sets we denote by J(E) the smallest hereditary ring 
containing E. If yu is a real valued, finite, non negative, and finitely additive 
set function defined on a ring R, and if, for every E in J(R), 


u*(E) = inf {u(F): EC FeR}, 


then p* is a real valued, finite, non negative, and finitely subadditive set func- 
tion on J(R). Is it true that, for E in R, u*(E) = w(£)? 

(3) A necessary and sufficient condition that a class H of subsets of a set X 
be an ideal in the Boolean ring of all subsets of X is that H be a hereditary ring; 
cf. 4.4. 

(4) The following are some examples of set functions defined on hereditary 
o-rings; some of them are outer measures, while others violate exactly one of the 
defining conditions of outer measures. 

(4a) X is arbitrary, H is the class of all subsets of X. For any fixed point 
xo in X, write u*(E) = xe(xo). 

(4b) X and H are as in (4a); u*(E) = 1 for every E in H. 

(4c) X = {x,y} is a set consisting of exactly two distinct points x and 3, 
BH is the class of all subsets of X, and u* is defined by the relations 


n*(0) = 0, u*({x}) = w*({y}) = 10, w*(X) = 1. 


(4d) X is a set of 100 points arranged in a square array of 10 columns each 
with 10 points; H is the class of all subsets of X; 4*(Z) is the number of columns 
which contain at least one point of E. 
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(4e) X is the set of all positive integers, H is the class of all subsets of X. 
For every finite subset E of X, v(E) is the number of points in E; 


uX*(E) = lim supe a WE NM {1, «++, m}). 


(4f) X is arbitrary, H is the class of all countable subsets of X, u*(E) is the 
number of points in E, (= 0 if E is infinite). 

(5) If u* is an outer measure on a hereditary o-ring H and &p is any set in 
H, then the set function uo*, defined by wo*(E) = w*(E | Ep), is an outer meas- 
ure on H. 

(6) If A* and w* are outer measures on a hereditary o-ring H, then the set 
function v*, defined by »*(E) = A*(E) U y*(£), is an outer measure on H. 

(7) If {un*} is a sequence of outer measures on a hereditary o-ring H and 
{a,} is a sequence of positive numbers, then the set function u* defined by 
u*(E) = Dof1 ann *(E), is an outer measure on H. 


§ 11. MEASURABLE SETS 


Let »* be an outer measure on a hereditary o-ring H. A set E 
in H is #*-~measurable if, for every set 4 in H, 


B*(A) = w*(4 0 E) + u*(4 0 E+). 


The concept of u*-measurability is the most important one 
in the theory of outer measures. It is rather difficult to get an 
intuitive understanding of the meaning of »*~measurability ex- 
cept through familiarity with its implications, which we propose 
to develop below. The following comment may, however, be 
helpful. An outer measure is not necessarily a countably, nor 
even finitely, additive set function (cf. 10.4d). In an attempt to 
satisfy the reasonable requirement of additivity, we single out 
those sets which split every other set additively—the definition 
of u*-measurability is the precise formulation of this rather loose 
description. The greatest justification of this apparently compli- 
cated concept is, however, its possibly surprising but absolutely 
complete success as a tool in proving the important and useful 
extension theorem of § 13. 


Theorem A. Jf u* is an outer measure on a hereditary 
o-ring Hi and if § is the class of all w*-measurable sets, then 
S is a ring. 
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Proof. If E and F are in § and 4 eH, then 
(a) u*(4) = u*(4N E) + u*(4 N EL’), 
(b) w(AN FE) =w(ANENF)+u(ANEN FP), 
() w(ANE)=w(AN ENP) +utX(An ENF’. 
Substituting (b) and (c) into (a) we obtain 
(d) w(4) = w(ANENP)+HAN ENF) 
+H(AN ENF) +e(An EN PF’. 


If in equation (d) we replace 7 by 4 N (E U P), the first three 
terms of the right hand side remain unaltered and the last term 
drops out; we get 


(e) BAN (EUFP) =e(AN ENP) +u(AN ENF’) 
+yt(An EN F). 
Since E’ N F’ = (E U F)’, substituting (e) into (d) yields 
(f) u*(4) = e*(AN (EU F)) +y*(4N EU FP), 
which proves that E U Fe §. 
If, similarly, we replace 4 in equation (d) by 4 N (E — F)’ = 
AN (E’ U P), we get 
(:) u(AN E-F)) =e (AN ENF) +e*(AN EN F) 
+ u*(4N E' NF’). 
Since E N F’ = E — F, substituting (g) into (d) yields 
(hy w*(4) = (AN (E- F)) + u*(4 0 (E-F)), 
which proves that E—FeS. Since it is clear that E = 0 
satisfies (a), it follows that Sis a ring. J 
Before proceeding with the study of the deeper properties of 
u*-measurability, we remark on the following elementary but 
frequently useful fact. 


If u* is an outer measure on a hereditary o-ring H. and if 
a set E in H is such that, for every 4 in H, 


u*(4) = w*(AN E) + u*(ANn £), 


then E is u*-measurable. 
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The proof of this remark is achieved simply by recalling that the 
reverse inequality, u*(4) S u*(4N E) + w*(4 N E’), is an auto- 
matic consequence of the subadditivity of u*. 


Theorem B. Jf u* is an outer measure on a hereditary 
o-ring H_ and if § is the class of all u*-measurable sets, thea § 
is ao-ring. If A eH and if {E,,} is a disjoint sequence of sets 
in S with U2. En = E, then 


B*(A NE) = Diver u*(4 2 E,). 


Proof. Replacing E and F in (e) by £, and Eg respectively, 
we see that 


u*(4 0 (A, U E2)) = w*(4 N E) + w*(4 2 Ey). 
It follows by mathematical induction that 
a*(4N Ui BE) = Dieru*(4 N E,) 
for every positive integer n. If we write 
F, = U?.1 EB; n=1,2,---, 
then it follows from Theorem A that 
w*(A) =e (AN Fi) +e(4N F,) 2 
2 Dierw*(4N E,) + u*(4 Nn LE). 
Since this is true for every n, we obtain 
(i) u*(4) = Diwt(AN ED +er(AN EZ 
2u*(AN E)+u*(4n £’. 


{t follows that E «eS (so that, by the way, S is closed under the 
formation of disjoint countable unions), and therefore that 


G) Dimiwt*(4N E) +u*(4N £’) = 

= u(4 NE) + p*(A Nn E’). 
Replacing 4 by 4 E in (j), we obtain the second assertion of 
the theorem. (Since p*(4 N E’) may be infinite, it is not permis- 
sible simply to subtract it from both sides of (j).) Since every 


countable union of sets in a ring may be written as a disjoint 
countable union of sets in the ring, we see also that Sis ao-ring. JJ 
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Theorem C. Jf u* is an outer measure on a hereditary 
o-ring H. and if § is the class of all »*-measurable sets, then 
every set of outer measure zero belongs to S and the set function 
fi, defined for E in § by ji(E) = w*(E), is a complete measure 
on 8. 


The measure fi is called the measure induced by the outer 


measure y*. 
Proof. If E «H and u*(£) = 0, then, for every 4 in H, we 
have 


u*(4) = p*(E) + u*(4) 2 (ANE) + u*(4N £), 


so that indeed E eS. The fact that g is countably additive on § 
follows from (j) upon replacing 4 by Z. If 


EeS, FCE, and (2) = u*(E) =0, 
then u*(F) = 0, so that F e¢ §, which proves that gis complete. ff 


(1) For the example 10.4d, a set E is u*-measurable if and only if with every 
point x in E the entire column which includes x is contained in Z. Which sets 
are w*-measurable in 10.4f? 

(2) If »* is an outer measure on a hereditary o-ring H, under what addi- 
tional conditions is the class of 1*-measurable sets an algebra? 

(3) In the notation of Theorem A, replacing 4 in equation (d) by 4N 
(E’ U F’) may be used to give a direct proof of the fact that S is closed under the 
formation of intersections. What would the same technique prove if 4 were 
replaced by AN (F— EY’ =AN(EU PY? 

(4) Let u* be a finite, non negative, monotone, and finitely subadditive set 
function with »* (0) = 0 on a hereditary ring J; cf. 10.2. The class of all u*- 
measurable sets is a ring, and the set function u* is additive on this ring. 

(3) Suppose that u* is an outer measure on a hereditary o-ring Hf and that 

is the class of all_u*-measurable sets. If 4¢H and {Ep} is an increasing 
sequence of sets in S, then u*(lim, (4 M En)) = lim, p*(4 E,). Similarly, 
if LE} is a decreasing sequence of sets in S, and if a set 4 in Hi is such that 

u*(4 1 Em) <0 for at least one value of m, then u*(lim, (41M En)) = 
lim» ad n E,). 

(6) If 4* is an outer measure on a hereditary o-ring H and if E and F are two 

sets in H of which at least one is 4*-measurable, then (cf. 9.2) 


BME) +y*(F) = u8(E UF) +u*(EN FP). 


(7) The results of this section could also have been obtained by means of 
partitions (cf. 7.5). A partition is a finite or infinite disjoint sequence {E.} of 
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sets such that UJ, Z; = X. If u* is an outer measure on a hereditary o-ring 
H, the partition {£,} is called a »p*=partition if 
u*(A4) = DiuX(4N £) 

for every 4 in H; a set E is a p.*-set if the partition {Z,E’} is a u*-partition. 
If {E;} and {F;} are partitions, then {£,} is called a subpartition of {F;} if 
each E; is contained in one of the sets F;. The product of two partitions, {£,} 
and {F;}, is the partition consisting of all sets of the form E; M F;. We note 
that a set in Hl is a u*-set if and only if it is u*~measurable in the sense of this 
section. 

(7a) The product of two 4*-partitions is a y*-partition. 

(7b) If a subpartition of a partition {£;} is a u*-partition, then {£,} is a 
u*-partition. 

(7c) A partition {E;} is a u*-partition if and only if each E; is a p*-set. 

(7d) The class of all 4*-sets is a o-ring. (Hint: the class of all u*-sets is a 
ring closed under the formation of countable disjoint unions.) 

(8a) An outer measure p* on the class Hl of all subsets of a metric space X 
is a metric outer measure if 


B(E U F) = w*(E) + u*(F) 
whenever p(Z,F) > 0, where p is the metric on X. If * is a metric outer meas- 
ure, if E is a subset of an open set U in X, and if E, = EN {= p(x,U’) = +} . 


n= 1,2, ---, then lim, u*(Z,) = u*(EZ). (Hint: observe that {Z,} is an in- 
creasing sequence of sets whose union is E. If Eo = 0, Da = Engi — En, and 
if neither D,41 nor Ey is empty, then p(Dn41,En) > 0, and it follows that 


u*(Eong1) 2 DP w*(Dax) and p*(Eon) 2 DF. e*(Day-2). 
The desired conclusion is trivial if either of the two series, 
Dieiv*(De:) and Shi e*(Da-1), 
diverges; if they both converge, then it follows from the relation 
BY(E) S w*(Esn) + Den w* (Dai) + Ding e*(Dai-1).) 


(8b) If 4* is a metric outer measure, then every open set (and therefore 
every Borel! set) is u*—measurable. (Hint: if U is an open set and 4 is an arbi- 
trary subset of X, apply (8a) to E= AN U. Since p(En,4 N U"’) > 0, it 
follows that 


u*(4) 2 u*(En U (4 UV’) = w*(Es) + u*(4 N U),) 


(8c) If u* is an outer measure on the class of all subsets of a metric space X 
such that every open set is u*—measurable, then u* is a metric outer measure. 
(Hint: if p(Z,F) > 0, let U be an open set such that EC Uand FN U=9Q, 
and test the u*-measurability of U with 4 = E U F.) 


Chapter [I 


EXTENSION OF MEASURES 


§12. PROPERTIES OF INDUCED MEASURES 


We have seen that a measure induces an outer measure and 
that an outer measure induces a measure, both in a certain natural 
way. If we start with a measure yu, form the induced outer meas 
ure »*, and then form the measure @ induced by u*, what is the 
relation between » and 7? The main purpose of the present sec- 
tion is to answer this question. Throughout this section we shall 
assume that 


p is a measure on a ring R, yp* is the induced outer measure 
on H(R), and i is the measure induced by yu* on the o-ring 
S of all u*-measurable sets. 


Theorem A. Every set in S(R) is u*-measurable. 


Proof. If Ee R, 4 2 H(R), and e > 0, then, by the definition 
of u*, there exists a sequence {Z,} of sets in R such that 
Ac Us: En and 


u*(A) +¢2 Diner m(En) = Liner (u(En NE) + w(E, N E£’)) 2 
2u*(4n E)+ w*(4n EL’). 


Since this is true for every e, it follows that E is y*-measurable. 
In other words, we have proved that R c §; it follows from the 
fact that S is ao-ring that S(R) CS. J 
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Theorem B. Jf E « H(R), then 
p*(E) = inf {a(F): EC F eS} = 
= inf {a(F): EC FeS)}. 


Equivalent to the statement of Theorem B is the assertion 
that the outer measure induced by g on S(R) and the outer meas- 
ure induced by f on S both coincide with p*. 

Proof. Since, for F in R, u(F) = a(F) (by the definition of 
i and 10.A), it follows that 


w*(E) = inf {S071 u(En): EC Ur. En, E, eR, 
ails 2, soe} = 
= inf {7.1 a(Z,): EC Uri £,, Ene S(R), 
n=1,2,---}. 


Since every sequence {E,} of sets in S(R) for which 
EcU;.14, =F 


may be replaced by a disjoint sequence with the same property, 
without increasing the sum of the measures of the terms of the 
sequence, and since, by the definition of g, a(F) = u*(F) for F 
in §, it follows that 


p*(E) 2 inf {a(F): EC FeSR)} 2 
= inf {a(F): ECF eS} 2u*(Z). | 


If EF e H(R) and F e S(R), we shall say that F is a measurable 
cover of E if Ec F and if, for every set G in S(R) for which 
GcF— E, we have f(G) = 0. Loosely speaking, a measurable 
cover of a set E in H(R) is a minimal set in S(R) which covers E. 


Theorem C. If a@ set Ein H(R) is of o—finite outer measure, 
then there exists a set F in S(R) such that w*(E) = a(F) and 
such that F is a measurable cover of E. 


Proof. If u*(E) = ~, and Ec F « S(R), then clearly a(F) = 
o, so that it is sufficient to prove the assertion p*(Z) = a(F) 
only in the case in which p*(E) < @. Since a set of o-finite 
outer measure is a countable disjoint union of sets of finite outer 
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measure, it is sufficient to prove the entire theorem under the 
added assumption that p*(E) < ©, 

It follows from Theorem B that, for every nm = 1, 2, ---, there 
exists a set F,, in S(R) such that 


EcF, and A(F,) S w*(E) + <. 
If we write F = ()%_, Fn, then 
EcFeS(R) and p*(E) S$ a(F) S$ a(F,) S$ uX(2 + - 


Since x is arbitrary, it follows that u*(Z) = a(F). If GeS(R) 
and Gc F — E, then E Cc F — G and therefore 


a(F) = w*(Z) Ss (F — G) = a) — AG) S AF); 
the fact that F is a measurable cover of E follows from the finite- 
ness of i(F). §f 
Theorem D. Jf Ee H(R) and F is a measurable cover of 

E, then p*(E) = a(F); if both F, and F, are measurable covers 

of E, then i(F, A F,) = 0. 

Proof. Since the relation EC F,N F.C F; implies that 
Fi -— (Fi N F:) c F, — E, it follows from the fact that F, is a 
measurable cover of E that 

a(F, — (Fi N F)) = 0. 
Since, similarly, 

a(F2 — (F, N F2)) = 0, 
we have, indeed, a(F; A F,) = 0. 

If u*(E) = «, then the relation u*(E) = a(F) is trivial; if 
h*(E) < &, then it follows from Theorem C that there exists a 
measurable cover Fo of E with 


(Fo) = u*(£). 


Since the result of the preceding paragraph implies that every 
two measurable covers have the same measure, the proof of the 
theorem is complete. §f 


Theorem E. Jf u on R is o—finite, then so are the measures 
fi on S(R) and ji on §. 
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Proof. According to 10.A, if «is o—-finite, then so is u*. Hence 
for every E in S there exists a sequence {F,} of sets in H(R) 
such that 


Ec Ui, Ei, p*(E;) < om, i= 1, 2s es 


An application of Theorem C to each set E; concludes the proof 
of the theorem. J 

The main question at the beginning of this section could have 
been asked in the other direction. Suppose that we start with 
an outer measure yu*, form the induced measure j, and then form 
the outer measure ji* induced by ~ What is the relation between 
p* and ~*? In general these two set functions are not the same; 
if, however, the induced outer measure f* does coincide with the 
original outer measure u*, then u* is called regular. The asser- 
tion of Theorem B is exactly that the outer measure induced by a 
measure on a ring is always regular. The converse of this last 
statement is also true: if u* is regular, then n* = j* is induced by 
a measure on a ring, namely by 4 on the class of u*-measurable 
sets. Thus the notions of induced outer measure and regular outer 
measure are coextensive. 


(1) Theorem D asserts that a measurable cover is uniquely determined to 
within a set of measure zero, if it exists at all; Theorem C asserts that for sets 
of o-finite outer measure a measurable cover does exist. The following con- 
siderations show that the hypothesis of o-finiteness cannot be omitted from 
Theorem C. 

If Z is a line in the Euclidean plane X, and E is any subset of X, we shall 
say that EZ is fullon Lif L — Eis countable. Let Ro be the class of all those sets 
E which may be covered by countably many horizontal lines on each of which 
E is either full or countable; let R be the algebra generated by Ro; (cf. 4.5). 
If, for every E in R, u(Z) = 0 or & according as E is countable or not, then » 
is a measure on R; it is easy to verify that in this case R = S(R) and S = H(R) 
is the class of all subsets of X. If E is the y -axis and E C Fe S(R), then there 
always exists a set G in S(R) such that G C F — E and u(G) ¥ 0. 

(2) A subset E of the real line is said to have an infinite condensation point 
if there are uncountably many points of £ outside every finite interval. Let 
X be the real line and define a set function u* on every subset E of X as follows: 
if E is finite or countably infinite, then u*(£) = 0; if EZ is uncountable but does 
not have an infinite condensation point, then w*(£) = 1; if E has an infinite 
condensation point, then u*(E) = ©. Then p* is a totally o-finite outer meas- 
ure, but, since the only u*-measurable sets are the countable sets and their 
complements, the induced measure ff is not o-finite. Is u* regular? What can 
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be said if, instead, »*(E) is defined to be 17 whenever E has an infinite con- 
densation point? 

(3) Let 2 be a fixed positive integer, and let No, Ni, °--, Nn be the first 2 ++ 1 
infinite cardinal numbers in the well ordering of the cardinals according to 
magnitude. If X is a set of cardinal number Nn, and £ is a finite subset of X, 
write u*(E) = 0; if the cardinal number of a subset E of X is the infinite cardinal 
N.,0 Sk Sn, write u*(E) = k. The set function y* is an outer measure; is it 
regular? 

(4) Ifu* is a regular outer measure on a hereditary o-ring H, and if {Z,} is an 
increasing sequence of sets in H with lim, EZ, = E, then p*(Z) = limau*(E,). 
(Hint: if lima w*(E,) = %, the result is clear. If not, then let F, be a u*—meas- 
urable cover of En, n = 1, 2, --+, so that the sequence {F,} is increasing, and 
write F = lim, Fn. Since u*(Fr) = w*(En) S u*(E), we have lima u*(Fn) = 
u*(F) S u*(E); since E C F, u*(Z) S u*(F). Hence F is a measurable cover 
of E.) This result is not true for non regular outer measures; a counter example 
may be constructed on the basis of (2) above. 

(5) For every subset E of an arbitrary set X write u*(E) = 0 or 1 according 
as E is empty or not; the set function p* is a regular outer measure on the class 
of all subsets of X. If {E,} is a decreasing sequence of non empty sets with an 
empty intersection (such a sequence exists whenever X is infinite), then 


lim, w*(En) = 1 and yu*(lim, Ey) = 0; 


in other words the analog of (4) for decreasing sequences is false even for totally 
finite, regular outer measures. 

(6) Let y1* and pe* be two finite outer measures on the class of all subsets 
of a set X, and let §,, i = 1, 2, be the class of u;*-measurable sets. If, for all 


subsets E of X, 
BYE) = wi *(Z) + w2*(Z), 


then the class § of all 4*-measurable sets is the intersection of 8; and Se. (Hint: 
if w*(4 NE) +u*(4 2 E’) = w*(A), then both the inequalities, u:*(4 M E) 
+yi*(4 N E’) = u,*(A), i = 1, 2, must become equalities.) What can be said 
if u,* and pe* are not necessarily finite? 

(7) Let py* be any finite, regular outer measure on the class of all subsets of a 
set X, and write ue*(Z) = 0 or | according as E is empty or not. Then pe* 
is also a finite, regular outer measure, but, if #1* assumes more than two values, 
then ui* + we* is not regular. 

(8) If X is a metric space, p is a positive real number, and E£ is a subset of X, 
then oe p-dimensional Hausdorff (outer) measure of E is defined to be the 
number 


Mp*(E) = sup.>oinf {001 (2): E = URE, 6(£) <6 i= 1,2,-°-}, 


where 5(Z) denotes the diameter of F. 

(8a) The set function yp* is a metric outer measure; cf. 11.8a. 

(8b) The outer measure py* is regular; in fact, for every subset E of X, 
there exists a decreasing sequence {U,} of open sets containing E such that 


Me*(Z) baa Bp*(s=1 Ua). 
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§ 13. EXTENSION, COMPLETION, AND APPROXIMATION 


Can we always extend a measure on a ring to the generated 
o-ring? The answer to this question is essentially contained in 
the results of the preceding sections; it is formally summarized 
in the following theorem. 


Theorem A. [f u is a o—finite measure on a ring R, then 
there is a unique measure ji on the o-ring S(R) such that, for E 
in R, i(E) = w(E); the measure ps is o-finite. 


The measure fi is called the extension of yu; except when it is 
likely to lead to confusion, we shall write u(Z) instead of g(Z) 
even for sets E in S(R). 

Proof. The existence of ~ (even without the restriction of 
o-finiteness) is proved by 11.C and 12.A. To prove uniqueness, 
suppose that mw, and we are two measures on S(R) such that 
ui(E) = u2(E) whenever E eR, and let M be the class of all sets 
E in S(R) for which »,(Z) = u2(E). If one of the two measures 
is finite, and if {E,} is a monotone sequence of sets in M, then, 


since 
ui(lim, Z,) = limau(E,), i = 1,2, 


we have lim, E, eM. (The full justification of this step in the 
reasoning makes use of the fact that one of the two numbers 
m,(E,) and po(E,), and therefore also the other one, is finite for 
every n = 1,2, ---; cf.9.D and 9.E.) Since this means that M 
is a monotone class, and since M contains R, it follows from 
6.B that M contains S(R). 

In the general, not necessarily finite, case we proceed as follows. 
Let 4 be any fixed set in R, of finite measure with respect to one 
of the two measures p; and py. SinceR N Aisaring and S(R)N 4 
is the o-ring it generates (cf. 5.E), it follows that the reasoning 
of the preceding paragraph applies to RN 4 and S(R)N 4, 
and proves that if Ee S(R)N 4, then pi(E) = we(F). Since 
every E in S(R) may be covered by a countable, disjoint union 
of sets of finite measure in R (with respect to either of the meas- 
ures 4, and po), the proof of the theorem is complete. Jf 

The extension procedure employed in the proofs of § 12 yields 
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slightly more than Theorem A states; the given measure » can 
actually be extended to a class (the class of all «*-measurable 
sets) which is in general larger than the generated o-ring. The 
following theorems show that it is not necessary to make use of 
the theory of outer measures in order to obtain this slight enlarge- 
ment of the domain of p. 


Theorem B. If u is a measure on a o-ring S, then the class 
S of all sets of the form EAN, where E eS and N is a subset 
of a set of measure zero in S, is a o-ring, and the set function ji 
defined by i(E AN) = p(E) is a complete measure on §. 


The measure fi is called the completion of u. 
Proof. If EeS,NcAeS, and u(4) = 0, then the relations 


EUN=(E-AMA[AN(EUN) 
and 


EAN =(E—A4)U[4N (EAN)] 


show that the class § may also be described as the class of all 
sets of the form E U N, where E ¢S and N is a subset of a set 
of measure zero in S. Since this implies that the class §, which 
is obviously closed under the formation of symmetric differences, 
is closed also under the formation of countable unions, it follows 
that Sis ao-ring. If 


Fy AN, = FF, A No, 


where E; eS and JN; is a subset of a set of measure zero in S, 
i = 1,2, then 
FE, A Ee = N, A No, 


and therefore u(E, A E2) = 0. It follows that u(E,) = u(Es), 
and hence that is indeed unambiguously defined by the relations 


H(E AN) = aE U N) = w(Z). 


Using the union (instead of the symmetric difference) representa- 
tion of sets in §, it is easy to verify that @ is a measure; the 
completeness of ji is an immediate consequence of the fact that 
5 contains all subsets of sets of measure zero in S. § 
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The following theorem establishes the connection between the 
general concept of completion and the particular complete exten- 
sion obtained by using outer measures. 


Theorem C. J[f u is a o—finite measure on a ring R, and if 
u* is the outer measure induced by p, then the completion of the 
extension of » to S(R) is identical with p* on the class of all 
p*—measurable sets. 


Proof. Let us denote the class of all u*-measurable sets by 
S* and the domain of the completion # of » by §. Since u* on 
S* is a complete measure, it follows that § is contained in S* 
and that @ and u* coincide on §. All that we have left to prove 
is that S* is contained in §; in view of the o-finiteness of u* on 
S* (cf. 12.E) it is sufficient to prove that if E eS* and p*(E) < ~, 
then Ee S 

By 12.C, Z has a measurable cover F. Since u*(F) = u(F) = 

u*(E), it follows from the finiteness of w*(E), and the fact that 
u* is a measure on S*, that u*(F — FE) = 0. Since F — E also 
has a measurable cover G, and since 


u(G) = uM(F - B) = 
the relation 
= (F—~G)U (ENG) 


exhibits E as a union of a set in S(R) and a set which is a subset 
of a set of measure zero in S(R). This shows that E e §, and thus 
completes the proof of Theorem C. § 

Loosely speaking, Theorem C says that in the o-finite case the 
o-ring of all u*-measurable sets and the generated o-ring S(R) 
are not very different; every u*—measurable set suitably modified 
by a set of measure zero belongs to S(R). 

We conclude this section with a very useful result concerning 
the relation between a measure on a ring and its extension to the 
generated o-ring. 


Theorem D. If p is a o—finite measure on a ring R, then, 
for every set E of finite measure in S(R) and for every positive 
number ¢, there exists a set Eg in R such that p(EA Ey) Se 
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Proof. The results of §§ 10, 11, and 12, together with Theorem 
A, imply that 


u(E) = inf {v1 u(E,): Ec Uni E,, E; eR, i= 1, 2; veel, 


Consequently there exists a sequence {£;} of sets in R such that 


Ec Uz, E; and w(UF1 E;) S w(£) +5 = 


Since 
lima w(Ui.1 Ed) = o(UF1 ED, 


there exists a positive integer ” such that if 


Eo = Uie1 Ei, 
then 


u(Ui-1 £) S u(Eo) + 


Clearly Ey ¢ R; since 


w(E ~ Eo) S$ u(UP1 Es — Eo) = o(U1 E)) — uF) S ; 
and 
u(Eo — E) S w(Uis Bi -— E) = (UP B) - wl) $5, 


the proof of the theorem is complete. J 


(1) Let u be a finite, non negative, and finitely additive set function defined 
onaring R. The function y* defined by the procedure of § 10 is still an outer 
measure, and, therefore, the @ of 11.C may still be formed, but it is no longer 
necessarily true that Z is an extension of us (cf. 10.2, 10.4e, and 11.4). 

(2) If & is the extension of the measure mw on the ring R described in § 8, 
then, for any countable set E, Ee S(R) and A(E) = 0. 

(3) The uniqueness assertion of Theorem A is not true if the class R is not a 
ring. (Hint: let X = {a,d,c,d} be a space of four points and define the measures 
(41 and pz on the class of all subsets of X by 


wai({a}) = wi({d}) = we({d}) = we({c}) = 1, 
wi({S}) = wi({c}) = wol{a}) = we({d}) = 2.) 


(4) Is Theorem A true for semirings instead of rings? 

(5) Let R be a ring of subsets of a countable set X, with the property that 
every non empty set in R is infinite and such that S(R) is the class of all subsets 
of X; (cf. 9.7). If, for every subset E of X, u,(£) is the number of points in E 
and wo(E) = 2u1(E), then we and py agree on R but not on S(R). In other words, 
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the uniqueness assertion of Theorem A is not true without the restriction of 
o-finiteness on R, even for measures which are totally o—finite on S(R). 

(6) Suppose that p is a measure on a o-ring S and that fi on S is its completion. 
If 4 and B are inS and if 4 C EC B, and u(B — A) = 0, then Ee S. 

(7) Let X be an uncountable set, let S be the class of all countable sets and 
their complements, and, for every E in S, let w(EZ) be the number of points in 
E. Then p is a complete measure on S, but every subset of X is u*-measurable; 
in other words, Theorem C is false without the assumption of o-finiteness. 

(8) If w and v are o—finite measures on a ring R, then, for every E in S(R) 
for which both p(£) and »(£) are finite and for every positive number ¢, there 
exists a set Eg in R such that 


W(EAE:)) Se and w(EAF) Se 


§14. INNER MEASURES 


We return now to the general study of measures, outer measures, 
and the relations among them, in order to describe an interesting 
and historically important part of the theory. 

We have seen that if u is a measure on a o-ring S, then the set 
function u* (defined for every E in the hereditary o-ring H(S) by 


u*(E) = inf {u(F): E Cc F ce S}) 


is an outer measure; in the o-finite case the induced measure 
on the o-ring § of all u*-measurable sets is the completion of u. 
Analogously we now define the inner measure ys induced by yp; 
for every E in H(S) we write 


px(E) = sup {u(F): EDFe$}. 


In this section we shall study yu» and its relation to u*; we shall 
show that the properties of us are in a very legitimate sense the 
duals of those of u*. It is very easy to see that the set function 
Mx is non negative, monotone, and such that ux(0) = 0; in what 
follows we shall make use of these elementary facts without any 
reference. Throughout this section we shall assume that 


pis a o-finite measure on a o-ring S, u* and pws are the outer 
measure and the inner measure induced by uy, respectively, 
and ji on §S is the completion of pu; 


we recall that g on S coincides with u* on the class of all «*-meas- 


urable sets (cf. 13.C). 


tn 
'S 
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Theorem A. Jf E « H(S), then 
ux(E) = sup {a(F): ED Fc}. 
Proof. Since S c§, it is clear from the definition of ys that 
ux(E) S sup {a(F): ED Fe 8}. 


On the other hand 13.B implies that, for every F in §, there is a 
GinS withG c Fand f(F) = u(G). Since this means that every 
value of ~ on subsets of E in § is also attained by yu on subsets of 
Ein §, the proof is complete. J 

If Ee H(S) and F eS, we shall say that F is a measurable 
kernel of E if Fc E and if, for every set G in S for which 
Gc E — F, we have u(G) = 0. Loosely speaking a measurable 
kernel of a set E in H(S) is a maximal set in S which is contained 
in E. 

Theorem B. Every set E in H(S) has a measurable kernel. 


Proof. Let £ bea measurable cover of E, let N be a measurable 
cover of £ — E, and write F = E— N. We have 


F=E-NcE-(E-EB)=E, 
and, if G Cc E — F, then 
GcE-(E-N=ENNCN-(E-B®). 


It follows (since N is a measurable cover of E — E), that F is a 
measurable kernel of E. Jj 


Theorem C. Jf Ee H(S) and F is a measurable kernel of 
E, then w(F) = px(E); if both F, and Fy are measurable 
kernels of E, then w(F, A F,) = 0. 


Proof. Since Fc E, it is clear that u(F) S wsa(E). If u(F) < 
pa(E), then u(F) is finite and, by the definition of ys(#), there 
exists a set Fy in S such that Fo C E and u(Fo) > u(F). Since 


Fo -FCE-—F and p(Fo — F) 2 u(Fo) — u(F) > 0, 


this is a contradiction, and therefore indeed u(F) = px(E). 
Since the relation F, Cc F, U F, C E implies that (F; U F) — 
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F, c E — F,, it follows from the fact that F, is a measurable 
kernel of E that 


u((F, U PF.) — F,) = 0. 
Since, similarly, 

u((F, U Fy) — Fy) = 0, 
we have p(Fi AF.) = 0. J 


Theorem D. Jf {E,.} is a disjoint sequence of sets in H(S), 
then 
ba (Urer E,) 2 are bs (E,). 


Proof. If F, is a measurable kernel of E,, 1 =1, 2, ++: 
then the countable additivity of u implies that 


Diver e(En) = Done w(Fa) = w(Ur-1 Fa) S we(UR-1 En). Ol 


Theorem E. J[f 4 H(S) and if {E,} is a disjoint sequence 
of sets in 8 with U2, En = E, then 


x(4 1 E) = Diner ua(4 N E,). 
Proof. If F is a measurable kernel of 4 N E, then 
us(4 0 E) = w(F) = Doe a(F ON Ex) S OPN oe (4 0 E,); 
the desired result follows from Theorem D. J 
Theorem F. If Ec §, then 
u*(E) = us (E) = (2), 
and, conversely, if E e H(S) and 


u*(E) = px(E) < ~, 
then E eS. 


Proof. If ES, then both the supremum in Theorem A and 
the infimum in 12.B are attained by g@(£). To prove the converse, 
let 4 and B be a measurable kernel and a measurable cover of E, 
respectively. Since u(4) = ux(E) < 0, we have 


u(B — A) = w(B) — (4) = w*(E) — ue (2) = 0, 
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and the desired conclusion follows from the completeness of @ 
on §; (cf. 11.C and 13.6). J 


Theorem G. Jf E and F are disjoint sets in H(S), then 
ba(E U F) S ue (E) + w*(F) S u*(E U F). 


Proof. Let 4 be a measurable cover of F and let B be a meas- 
urable kernel of E U F. Since B — 4c E, it follows that 


ua(E U F) = p(B) S o(B — 4) + w(4) S us(E) + *(F). 


Dually, let 4 be a measurable kernel of E and let B be a meas- 
urable cover of EU F. Since B — 4 2 F, it follows that 


w*(E U F) = w(B) = oA) + w(B — 4) 2 oe (EF) + (PF). IY 
Theorem H. Jf E &§, then, for every subset A of X, 
bx(4 N E) + u*(4’  E) = ai(Z). 
Proof. Applying Theorem G to 4 N Eand 4’ N E, we obtain 
Ha(E) S ux(4N E) + *(4'N EZ) S w*(£). 


Since E eS, we have, by Theorem F, ps(E) = u*(E) = a(E). | 

The results of this section enable us to sketch the steps of an 
alternative approach to the extension theorem, an approach that 
is frequently employed. If u is a o—finite measure on a ring R, 
and if u* is the induced outer measure on H(R), then, for every 
set E in R with p(E) < ~ and for every 4 in H(R), we have 


Ha(4 M E) = p(E) — p*(a’ NE). 


If we prove now that whenever £ and F are two sets of finite 
measure in R for which 4N E = ANF, then it follows that 
u(E) — p*(4’ N E) = w(F) — w*(d’ N FP), then we may use the 
equation for ux(4 NM E) as a definition of inner measure, and we 
may define a set E in H(R) of finite outer measure to be u*—meas- 
urable if and only if wx(Z) = w*(Z). The details of this procedure 
may be easily carried out by the interested reader, using the 
techniques we have introduced in our development of the exten- 
sion theory. 
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(1) Do the results of 12.4 remain true if 4* is replaced by us? 

(2) With suitable finiteness restrictions the dual of 12.4 is true for inner 
measures, but the unaltered result of 12.4 is not; (cf. 12.5). 

(3) If E is a set of finite measure in S, if Fc E, and if @(E) = u*(F) + 

p*(E — F), then Fe S. In other words the B *measurability of F may be 
tested by employing a fixed set E (containing F) in § instead of an arbitrary 4 
in H(S). (Hint: use Theorem H.) 

(4) Is an analog of 11.6 true for inner measures? 

(5) If Ee H(S) and F is a measurable cover of E, then, for every u*-measur- 
able set M,a(F N M) = n*(EN M). (Hint: apply TheoremHtoE = FNM 
and 4 = E’.) Conversely, any set F with this property andsuch that EC FeS 
is a measurable cover of E. Similarly, F is a measurable kernel of E if and only 
if ED FeSanda(F N M) = us(E N M) for every Min §. 


§ 15. LEBESGUE MEASURE 


The purpose of this section is to apply the general extension 
theory to the special measure discussed in § 8, and to introduce 
some of the classical results and terminology pertinent to this 
special case. Throughout this section we shall assume that 


X is the real line, P is the class of all bounded, semiclosed 
intervals of the form [a,4), S is the o-ring generated by P, 
and yu is the set function on P defined by u({2,4)) = b — a. 


The sets of the o-ring S are called the Borel sets of the line; 
according to the extension theorems 8.E and 13.A, we may assume 
that u is defined for all Borel sets. If f on § is the completion 
of » on §, the sets of S are the Lebesgue measurable sets of the 
line; the measure ~ is Lebesgue measure. (The incomplete 
measure pv on the class S of all Borel sets is usually called Lebesgue 
measure also.) 

Since the entire line X is the union of countably many sets in 
P, we see that X eS, so that the o-rings S and § are even 
o-algebras. Since clearly u(X) = ©, uw is not finite on S, but, 
since pu is finite on P, both » on S and fi on § are totally o—finite. 
Some of the other interesting properties of » and @ are contained 
in the following theorems. 


Theorem A. Every countable set is a Borel set of measure 
zero. 
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Proof. For any a4, —~ < a < ~, we have 


{a} = {x:~x =a} = ()F-1 frase<at-, 
and therefore 


u({a}) = lim, u(|ae + -)) = lim, — = 0, 


so that every one-point set is a Borel set of measure zero. Since 
the Borel sets form a o-ring and since p is countably additive, 
the theorem follows. § 
Theorem B. The class S of all Borel sets coincides with 
the o-ring generated by the class U of all open sets. 

Proof. Since, for every real number a, the set {a} is a Borel 
set, it follows from the relation (a,b) = [a,b) — {a}, that every 
bounded open interval is a Borel set. Since every open set on 
the line is a countable union of bounded open intervals, it follows 
that S>U and consequently that S>S(U). To prove the 
reverse inequality, we observe that, for every real number a, 


so that {a} eS(U). It follows from the relation [2,b) = (a,b) U 
{a} that P c S(U) and consequently that 


S=S@®)cS). J 


Theorem C. Jf U is the class of all open sets, then, for 
every E in X, 


p*(E) = inf {u(U): Ec UeV}. 


Proof. Since y*(E) = inf {p(F): E c F eS}, it follows from 
the fact that U c S that 


e*(E) S inf {u(U): Ec UecU}. 


If, on the other hand, ¢ is any positive number, then it follows 
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from the definition of u* that there exists a sequence {[an,b,)} 
of sets in P such that 


EC User [anbn) and S221 (bn — an) S w*(E) + 7 


Consequently 
+ € 
Ec Uses (on — 555 on) = UctI, 


and 


w(U) S Doras (bn — an) + 5 S wt(kE) +. 


The desired result follows from the arbitrariness of «. Jf 


Theorem D. Let T be the one to one transformation of the 
entire real line onto itself, defined by T(x) = ax + B, where 
a and B are real numbers anda #0. If, for every subset E 
of X, T(E) denotes the set of all points of the form T(x) with 
x in E, i.e. T(E) = {ax + B: x © E}, then 


u*(T(E)) = |a|u*(Z) and px(T(E)) = | a |ue(E). 


The set T(E) is a Borel set or a Lebesgue measurable set if and 
only if Eis a Borel set or a Lebesgue measurable set, respectively. 


Proof. It is sufficient to prove the theorem for a >0. For, 
if a <0, then the transformation T is the result of the iteration 
of two transformations T; and J), T(x) = T,(T2(x)), where 
Ti(x) = | a|x + P and T2(x) = —x. We leave to the reader the 
verification of the fact that the transformation JT, sends Borel 
sets and Lebesgue measurable sets into Borel sets and Lebesgue 
measurable sets, respectively, and that it preserves the inner and 
outer measures of every set. 

Suppose then that a > 0, and let T(S) be the class of all sets 
of the form T(E) with E in S. It is clear that T(S) is a o-ring; 
we are to prove that 7(S) = S. If EF = [2,4) e P, then E = T(F), 
where 


so that E e T(S) and therefore S c T(S). By the same reasoning 
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E . x — 
applied to the transformation T~', T-!(x) = a » we may con- 


clude that S c T—(S), and, applying the transformation T to 
both sides of the last written relation, we obtain, T(S) C S and 
therefore T(S) = S. 

If, for every Borel set E, we write 


m(E) = n(T(E)) and pe(E) = an(Z), 


then both », and pe are measures on S. If E = (a,b) e P, then 
T(E) = [aa + B, ab + B), and 


m(E) = w(T(E)) = (ab + B) — (aa + B) = a(6 — a) = 
= an(E) = w(E), 


so that, by 8.E and 13.A, u(T(E)) = apu(£) for every E in S. 
Applying the results of the preceding two paragraphs to the 
transformation T~', we obtain the relations 


p*(T(E)) = inf {u(F): T(E) Cc Fe S} = 
= inf {au(T(F)): E c T"'(F) eS} = 
= ainf {u(G): EC GeS} = 
= au*(E), 
and, replacing inf by sup, u* by us, and C by > throughout, 


us (T(E)) = ous(E), 
for every set E. 
If E is a Lebesgue measurable set and is any set, then 


u*(4 0 T(E)) + w*(4 1 (T(E))’) = 

u*(T(T-'(4) N E)) + w*(T(T(4) 9 E’)) = 
ofu*(T—(4) N E) + w*(T-1(4) N EB) = 
ap*(T'(A)) = 

n*(A), 


so that T(Z) is Lebesgue measurable. This result applied to T~ 
proves its own converse and completes the proof of the theorem. J 


li 
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(1) The class of all Borel sets is the o-ring generated by the class C of all 
closed sets, and, for every set E, 


wx(E) = sup {u(C): ED CeC}. 


(2) To every Lebesgue measurable set E there correspond two Borel sets 
A and B such that 


ACECB, uw(B— A) =0, 


and such that 4 is an F, and B is a G3. 

(3) A bounded set has finite outer measure. Is the converse of this statement 
true? 

(4) Let {x1, x2, ---} be an enumeration of the set M of rational numbers in 
the closed unit interval X. For every € > 0 and i = 1, 2, ---, let F,(e) be the 


open interval of length Fi whose center is at x;, and write 
1 
FQ = Uni Fe, F=NiuF(5)- 


The following statements are true. 

(4a) There exists an € > 0 and a point x in X such that x e’ F(e). 

(4b) The set F(e) is open and u(F(e)) Se 

(4c) The set X — F(e) is nowhere dense. 

(4d) The set X — F is of the first category and therefore, since X is a com- 
plete metric space, F is uncountable. (Hence, in particular, F ~ M.) 

(4e) The measure of F is zero. 

Since F D M, the statement (4e) yields a new proof of the fact that the set 
M of rational numbers (as every countable set) has measure zero. More inter- 
esting than this, however, is the implied existence of an uncountable set of 
measure zero; cf. (5). 

(5) Expand every number « in the closed unit interval X in the ternary system, 
Le. write 


#2 D3. a, = 0, 1, 2, m=1,2,---, 


and let C be the set of all those numbers « in whose expansion the digit 1 is not 
needed. (Observe that if, motivated by the customary decimal notation, we 
write .aa2:++ for 39.1 an/3", then for instance } = .1000--+ = .0222---, and 
therefore } ¢ C, but since } = .111--- and since this is the only ternary expan- 
sion of 3, therefore 4 2’ C.) Let X1 be the open middle third of X, Xi = (4, 3); 
let X_ and X3 be the open middle thirds of the two closed intervals which make 
up X — X, te. X2 = (3, %) and X3 = (§, $); let X4, Xs, Xe, and X7 be the 
the open middle thirds of the four closed intervals which make up 


X— (xX, U XU X3), 


and so on ad infinitum. The following statements are true. 
(5a) C=X—U?.1 Xn. (Hint: for every x in X write x = .aia2:--, 
a, = 0,1,2,2 = 1,2, ---,in sucha way that ifxeC, thena, = Oor2,” = 1,2, 
Then the expansion of x is unique and (i) x eX, if and only if a, = 2, 
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(ii) ify 1, then x e X_ U Xz if and only if ag = 1; (iii) ifay ¥ 1 and a, ¥ 1, 
then «eX, U X;, U X, U_X; if and only if a3 = 1; +++.) 

(5b) 2(C) = 0. 

(5c) C is nowhere dense. (Hint: assume that X contains an open subinterval 
whose intersection with U7-1 Xn is empty.) 

(5d) Cis perfect. (Hint: no two of the intervals Xi, X2, -+- have a common 
point.) 

(Se) C has the cardinal number of the continuum. (Hint: consider the corre- 
spondence which associates with every x in C, x = .aj@---, an =O or 2, 
n = 1,2, ---, the number y whose dinary expansion is y = .BiB2--+, Bn = Qn/2, 
n=1, 2, -+-, or, equivalently, y = )°%.,;a,/2"t+1, This correspondence is 
not one to one between C and X, but it is one to one between the irrational 
numbers in C and the irrational numbers in X. Alternative hint: use (5d).) 

The set C is called the Cantor set. 

(6) Since the cardinal number of the class of all Borel sets is that of the 
continuum (cf. 5.9c), and since every subset of the Cantor set is Lebesgue 
measurable (cf. (5b)), there exists a Lebesgue measurable set which is not a 
Borel set. 

(7) The set of those points in the closed unit interval in whose binary expan- 
sion all the digits in the even places are 0 is a Lebesgue measurable set of measure 
zero. 

(8) Let X be the perimeter of a circle in the Euclidean plane. There exists a 
unique measure yu defined on the Borel sets of X such that u(X) = 1 and such 
that w is invariant under all rotations of X. (A subset of a circle is a Borel set 
if it belongs to the o-ring generated by the class of all open arcs.) 

(9) If g is a finite, increasing, and continuous function of a real variable, 
then there exists a unique complete measure ff, defined on a o-ring S, containing 
all Borel sets, such that #,([a2,b)) = g(4) — g(a) and such that for every E in 
s, there is a Borel set F with £,(E A F) = Q; (cf. 8.3). The measure ji, is called 
the Lebesgue-Stieltjes measure induced by g. 


§ 16. NON MEASURABLE SETS 


The discussion in the preceding section is not delicate enough 
to reveal the complete structure of Lebesgue measurable sets on 
the real line. It is, for instance, a non trivial task to decide 
whether or not any non measurable sets exist. It is the purpose 
of this section to answer this question, as well as some related 
ones. Some of the techniques used in obtaining the answer are 
very different from any we have hitherto employed. Since, how- 
ever, most of them have repeated applications in measure theory, 
usually in the construction of illuminating examples, we shall 
present them in considerable detail. Throughout this section we 
shall employ the same notation as in § 15. 
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If E is any subset of the real line and a is any real number, 
then E + a denotes the set of all numbers of the form x + a, 
with x in E; more generally if Z and F are both subsets of the real 
line, then E + F denotes the set of all numbers of the form « + y 
with x in E and y in F. The symbol D(£) will be used to denote 
the difference set of E, i.e. the set of all numbers of the form 
x — y with x in & and y in E. 

Theorem A. If E is a Lebesgue measurable set of positive, 


finite measure, and if0 S a < 1, then there exists an open inter- 
val U such that p(E N VU) = ap(V). 


Proof. Let U be the class of all open sets. Since, by 15.C, 
g(E) = inf {u(U): E c U eV}, we can find an open set Up such 
that E c Up and au(Up) S a(E). If {U,} is the disjoint sequence 
of open intervals whose union is Up, then it follows that 


a yonci p(U,) s pas R(E nN U,)- 
Consequently we must have au(U,) S$ a(E N U,) for at least 
one value of ; the interval U, may be chosen for U. 


Theorem B. Jf E is a Lebesgue measurable set of positive 
measure, then there exists an open interval containing the 
origin and entirely contained in the difference set D(E). 


Proof. If E is, or at least contains, an open interval, the result 
is trivial. In the general case we make use of Theorem A, which 
asserts essentially that a suitable subset of E is arbitrarily close 
to an interval, to find a bounded open interval U such that 


A(E N U) = u(U). 
If —4yu(U) < x < $u(U), then the set 
(EN U)U (EN U) +x) 


is contained in an interval (namely U U (U + x)) whose length 
is less than $u(U). If EMU and (EN U) + <x were disjoint, 


then, since they have the same measure, we should have 


A(E NU) U (EN U) + «)) = 2(E NU) 2 gu(U). 
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Hence at least one point of E N U belongs also to (E N U) + x, 
which proves that x e D(E). In other words the interval (—$u(U), 
3u(U)) satisfies the conditions stated in the theorem. Jf 


Theorem C. Jf & is an irrational number, then the set A 
of all numbers of the form n + mé, where n and m are arbitrary 
integers, is everywhere dense on the line; the same is true of the 
subset B of all numbers of the form n + mé with n even, and the 
subset C of numbers of the form n + mé with n odd. 


Proof. For every positive integer i there exists a unique 
integer 2; (which may be positive, negative, or zero) such that 
0 <n; + if < 1; we write x; = ; + if. If U is any open inter- 


; cee 1 
val, then there is a positive integer k such that u(U) > z Among 
the k + 1 numbers, *, ---, *.41, in the unit interval, there must 
tbe at least two, say x; and x;, such that | x; — x,;| < 7 It 


follows that some integral multiple of x; — x;, i.e. some element 
of 4, belongs to the interval U, and this concludes the proof of 
the assertion concerning 4. The proof for B is similar; we have 
merely to replace the unit interval by the interval [0,2). The 
proof for C follows from the fact thatC = B+1. J 


Theorem D. There exists at least one set Ey which is not 
Lebesgue measurable. 


Proof. For any two real numbers x and y we write (for the 
purposes of this proof only) « ~y if x —yeA, (where 4 is 
the set described in Theorem C). It is easy to verify that the 
relation “~’” is reflexive, symmetric, and transitive, and that, 
accordingly, the set of all real numbers is the union of a disjoint 
class of sets, each set consisting of all those numbers which are 
in the relation “~” with a given number. By the axiom of choice 
we may find a set Ey containing exactly one point from each such 
set; we shall prove that Ey is not measurable. 

Suppose that F is a Borel set such that F Cc Eo. Since the 
difference set D(F) cannot contain any non zero elements of the 
dense set 4, it follows from Theorem B that F must have meas- 
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ure zero, so that ws(Zo) = 0. In other words, if Ey is Lebesgue 
measurable, then its measure must be zero. 

Observe next that if a, and ag are two different elements of 
A, then the sets Ey + a, and Ey + ae are disjoint. (If x, + a, = 
Xq + a, with x, in Ey and x2 in Eo, then «1 — *2 = ae — a, © A.) 
Since moreover the countable class of sets of the form Ey + a, 
where a e 4, covers the entire real line, ic. Eo + 4 = X, and 
since the Lebesgue measurability of Hy would imply that each 
Ey + @ is Lebesgue measurable and of the same measure as Ep, 
we see that the Lebesgue measurability of Ey) would imply the 
nonsensical result p(X) = 0. Jf 

The construction in the proof of Theorem D is well known, but 
it is not strong enough to yield certain counter examples needed 
for later purposes. The following theorem is an improvement. 


Theorem E. There exists a subset M of the real line such 
that, for every Lebesgue measurable set E, 


ux(M NE) =0 and u*(M NE) = a(B). 


Proof. Write 4 = BUC, as in Theorem C, and, if Ep is 
the set constructed in the proof of Theorem D, write 


M=E£,+ B. 


If F is a Borel set such that F c M, then the difference set D(F) 
cannot contain any elements of the dense set C, and it follows 
from Theorem B that ux(M) = 0. The relations 


M=Ey+C=Ey+(B+1)=M+4+1 


imply that ps(M’) = 0; (cf. 15.D). If E is any Lebesgue meas- 
urable set, then the monotone character of yx implies that 
pa( M0 E) = ps(M’ N E) = 0, and therefore (14.H) »*(7 N £) 
= f(E). J 

The proofs of this section imply among other things that it is 
impossible to extend Lebesgue measure to the class of all subsets 
of the real line so that the extended set function is still a measure 
and is invariant under translations. 
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(1) If E is a Lebesgue measurable set such that, for every number « in an 
everywhere dense set, 
MEA(E + x)) = 0, 


then either @(E) = 0 or else @(E’) = 0. 

(2) Let » be a o—finite measure on a o-ring S of subsets of a set X, and let 
u* and ws be the outer measure and the inner measure, respectively, induced 
by # on H(S). Let M be any set in H(S), and let S be the o-ring generated 
by the class of all sets in S together with M@. The chain of assertions below is 
designed to lead up to a proof of the assertion that u may be extended to a 
measure ji on 8S. 

(2a) The o-ring § is the class of all sets of the form (EN M)A(FN M’, 
where E and F are in S. (Hint: it is sufficient to prove that the class of all sets 
of the indicated form is a o-ring. Observe that 


(EN MA(FN M’) = (EN M)U (FN M’).) 


(2b) If 4*(M) < 0, if G and H are a measurable kernel and a measurable 
cover of M respectively, and if D = H — G, then the intersection of any set in 
S with D’ belongs to S. 

(2c) There exist two sets G and HinS such thatG C M C Hand ps (M — G) 
= pe(H — M) = 0, and such that if D = H — G, then the intersection of any 
set in S with D’ belongs to S. (Hint: there exists a disjoint sequence {Xn} 
of sets in S with n(X,) < © and M = U%_.1(MN X,).) 

(2d) In the notation of (2c), #s(M M D) = ws(M’ N D) = 0, and therefore 
u*(M 1) D) = u*(M’ ND) = u(D). 

(2e) In the notation of (2c), if 


(AN MAANAMIND=[(BNMARN MND, 
where E;, Fi, E2, and F are in S, then 
mE, 1 D) = (E21 D) and w(Fi MN D) = (Fe N D). 
(Hint: use the fact that the condition 
(A AE) NMN DIA[(Fi AR) NM’ ND) =0 
implies that 
(AN DACA ND)CM' ND and (ANDAR ND) CMND,) 


(2f) Let a and B be non negative real numbers witha +8 = 1. In the nota- 
tion of (2c) the set function g on §, defined by 


A(EN M)A(FN M’) = 
=u((EN M) 4 (FNM) ND’) + on(Z ND) + BuF ND), 


is a measure on § which is an extension of 4 on S. 

(3) If » is a o-finite measure on a o-ring S and if {M,, ---, Mn} is a finite 
class of sets in the hereditary o-ring H(S), then {M, ---, M,} may be adjoined 
to S and a measure fi may be defined on the generated o-ring § so that it is an 
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extension of 4on S. The analogous statement, for an infinite sequence {Mn} 
of sets in H(S), is not known 

(4) The following example is useful for developing intuition about non 
measurable sets; virtually all general properties of non measurable sets may be 
illustrated by it. Let X = {(x,y):0 S «3 1,0S y S 1} be the unit square. 
For every subset E of the interval [0,1], write 


E = {(xy):xeE, OSy SIC. 


Let S be the class of all sets of the form £, for Lebesgue measurable sets E; 
define u(£) as the Lebesgue measure of E. A set M such as M = {(x,y): 
0 SxS 1,¥y = 4} is non measurable; u4(M) = 0 and u*(M) = 1, 

(5) Let u* be a regular outer measure on the class of all subsets of a set X 
such that #*(X) = 1, and let M be a subset of X such that uy(M) = 0 and 
u*(M) = 1; (cf. Theorem E and (4)). If v*(Z) = p*(E) + *(E N M), then 
v* is an outer measure; (cf. 10.5 and 10.7). 

(5a) A set E is v*-measurable if and only if it is u*-measurable; (cf. 12.6). 

(5b) The infimum of the values of »*(E) over all »*-measurable sets E con- 
taining a given set 4 is 2u*(4). (Hint: if E is y*-measurable, then n*(E M M) 
= 2*(Z).) 

(5c) The outer measure v* is not regular. (Hint: test regularity with M’.) 


Chapter IV 


MEASURABLE FUNCTIONS 


§ 17. MEASURE SPACES 


A measurable space is a set X and a o-ring S of subsets of X 
with the property that UJ S = X. Ordinarily it causes no con- 
fusion to denote a measurable space by the same symbol as the 
underlying set X; on the occasions when it is desirable to call 
attention to the particular o-ring under consideration, we shall 
write (X,S) for X. It is customary to call a subset E of X meas- 
urable if and only if it belongs to the o-ring S. This terminology 
is not meant to indicate that S is the o-ring of all »*—measurable 
sets with respect to some outer measure u*, nor even that a non 
trivial measure is or may be defined on S. 

In the language of measurable sets, the condition in the defini- 
tion of measurable spaces may be expressed by saying that the 
union of all measurable sets is the entire space, or, equivalently, 
that every point is contained in some measurable set. The purpose 
of this restriction is to eliminate certain obvious and not at all 
useful pathological considerations, by excluding from the space 
points (and sets of points) of no measure theoretic relevance. 

A measure space is a measurable space (X,S) and a measure 
p on $3; just as for measurable spaces we shall ordinarily allow 
ourselves to confuse a measure space whose underlying set is X 
with the set X. On the occasions when it is desirable to call atten- 
tion to the particular o-ring and measure under consideration, we 
shall write (X,S,u) for X. The measure space X is called [totally] 


finite, o-finite, or complete, according as the measure yu is [totally] 
73 
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finite, o—finite, or complete. For measure spaces we may and shall 
make use, without any further explanation, of the outer measure 
u* and (in the o-finite case) the inner measure ws» induced by yu 
on the hereditary o-ring H(S). 

Most of the considerations of the preceding chapter show by 
deductions and examples how certain measurable spaces may be 
made into measure spaces. In this section we shall make a few 
general remarks on measurable spaces and measure spaces and 
then, in the remainder of this chapter and in the following chap- 
ters, turn to the discussion of functions on measure spaces, useful 
ways of making new measure spaces out of old ones, and the 
theory of some particularly important special cases. 

We observe first that a measurable subset Xp of a measure 
space (X,S,u) may itself be considered as a measure space 
(Xo0,So,#0), where Sp is the class of all measurable subsets of Xo, 
and, for E in So, wo(Z) = w(E). Conversely, if a subset Xo of a 
set X is a measure space (X0,Sp,uo), then X may be made into a 
measure space (X,S,u), where S is the class of all those subsets 
of X whose intersection with Xo is in Sp, and, for E in S, p(Z) = 
po(E N Xo). (Entirely similar remarks are valid, of course, for 
measurable spaces.) A modification of this last construction is 
frequently useful even if X is already a measure space. If Xp isa 
measurable subset of X, a new measure po may be defined on the 
class of all measurable subsets E of X by the equation yo(Z) = 
p(E 1M Xp); it is easy to verify that (X,S,uo) is indeed a measure 
space. 

What happens to the considerations of the preceding paragraph 
if the subset Xo is not measurable? In order to give the most 
useful answer to this question, we introduce a new concept. A 
subset Xo of a measure space (X,S,u) is thick if ux(Z — Xo) = 0 
for every measurable set E. If X itself is measurable, then Xo 
is thick if and only if wex(X — Xo) = 0; if u is totally finite, then 
Xo is thick if and only if w*(Xo) = u(X). (For examples of thick 
sets cf. 16.E and 16.4.) Slightly deeper than any of the comments 
in the preceding paragraph is the following result, which asserts 
essentially that a thick subset of a measure space may itself be 
regarded as a measure space. 
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Theorem A. If Xo is a thick subset of a measure space 
(X,S,n), ff So=SN Xo, and if, for E in S, w(E lM Xo) = 
u(E), then (Xo,So,mo) is @ measure space. 


Proof. If two sets, E; and Eo, in S are such that E, X= 
E, nN Xo, then (Fi A E2) nN Xo = 0, so that u(E, A E2) = 0 and 
therefore u(E,) = w(E2). In other words po is indeed unam- 
biguously defined on Sp. 

Suppose next that {F,,} is a disjoint sequence of sets in So, 
and let E, be a set in S such that 


F, = E,N X, n= 1,2, --:. 
If FE, = E, — U{E:l Si <n},n=1,2, -+-, then 
(E,AE,) NX =(Fi- Ul{Fsl si<n})aAFk, = 
= F, AF, = 0, 
so that u(E, A E,) = 0, and therefore 
Dora Ho(Fn) = Doner M(En) = Dore w(E) = w(U2-1 £,) = 
WMUner En) = wo(Un-1 F,). 


In other words pg is indeed a measure, and the proof of the theorem 
is complete. J 


(1) The following converse of Theorem A is true. If (X,S,u) is a measure 
space and if Xo is a subset of X such that, for every two measurable sets E; 
and Eo, the condition E; 1 Xo = Ee M Xo implies that w(E)) = u(E2), then 
Xois thick. (Hint: if FC E — Xo, then 


(E-F)NX=EN Xo) 


(2) The extension theorem 16.2 may be used to give an alternative proof 
of Theorem A in the o-finite case. 

(3) The following proposition shows that the concept of a finite measure space 
is not very different from the apparently much more special concept of a totally 
finite measure space. If (X,S,u) is a finite measure space, then there exists a 
thick measurable set Xo. (Hint: write c = sup {u(E): EeS}. Let {£,} be a 
sequence of measurable sets such that lima u(En) = ¢ and write Xo = Use: En. 
Observe that u(Xo) = ¢.) This result enables us, in most applications, to assume 
that a finite measure space is totally finite, since we may replace X by Xo with- 
out significant loss of generality. For an example of a finite measure space 
which is not totally finite, let X be the real line, let S be the class of all sets of 
the form E U C, where E is a Lebesgue measurable subset of [0,1] and C is 
countable, and let 4 on S be Lebesgue measure. The method suggested above 
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to show the existence of Xo has frequent application in measure theory; it is 
called the method of exhaustion. 

(4) If (X,S,z) is a complete, o-finite measure space, then every u*-measurable 
set is measurable. Hence for complete, o—finite measure spaces the two con- 
cepts of measurability collapse into one. 


§ 18. MEASURABLE FUNCTIONS 


Suppose that f is a real valued function on a set X and let M 
be any subset of the real line. We shall write 


f(M) = {x: f(x) eM}, 


i.e. f—'(M) is the set of all those points of X which are mapped 
into M by f. The set f—'(M) is called the inverse image (under 
f, or with respect to f) of the set M. If, for instance, f is the 
characteristic function of a set E in X, then f—1({1}) = E and 
f-({0}) = E’; more generally 


f7(M) =0, E, E'’, or X, 


according as M contains neither 0 nor 1, 1 but not 0, 0 but not 1, 
or both 0 and 1. 
It is easy to verify that, for every /, 
7 Wei M,) = rat (Mz), 
[OAL — Ny =f") =f WN); 

in other words the mapping f~', from subsets of the line to subsets 
of X, preserves all set operations. It follows in particular that if 
E is a class of subsets of the line (such as a ring or a o-ring) with 
certain algebraic properties, then f—'(E) (= the class of all those 
subsets of X which have the form f—'(M) for some M in E) is a 
class with the same algebraic properties. Of particular interest 
for later applications is the case in which E is the class of all 
Borel sets on the line. 

Suppose now that in addition to the set X we are given also a 
o-ring S of subsets of X so that (X,S) is a measurable space. 
For every real valued (and also for every extended real valued) 
function f on X, we shall write 


N(f) = {x f(x) # 0}; 
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if a real valued function / is such that, for every Borel subset M 
of the real line the set N(f) MN f7'(M) is measurable, then / is 
called a measurable function. 

Several comments are called for in connection with this defini- 
tion of measurability. First of all, the special role played by the 
value 0 should be emphasized. The reason for singling out 0 
lies in the fact that it is the identity element of the additive group 
of real numbers. In the next chapter we shall introduce the con- 
cept of integral, defined for certain measurable functions; the 
fact that integration (which is without doubt the most important 
concept in measure theory) may be viewed as generalized addition 
necessitates treating 0 differently from other real numbers. 

If f is a measurable function on X and if we take for M the 
entire real line, then it follows that N(/) is a measurable set. 
Hence if E is a measurable subset of X and if M is a Borel subset 
of the real line, then it follows from the identity 


EN {f"(M)} = 
=[EN NS) Nf" (M)) U [(E - NY) NFO), 


that E N f-1(M) is measurable. (Observe that the second term 
in the last written union is either empty or else equal to 
E — N(f).) If, in other words, we say that a real valued func- 
tion f defined on a measurable set £ is to be called measurable on 
E whenever E M f—!(M) is measurable for every Borel set M, 
then we have proved that a measurable function is measurable 
on every measurable set. If, in particular, the entire space X 
happens to be measurable, then the requirement of measurability 
on f is simply that f~'(M4) be measurable for every Borel subset 
M of the real line. In other words, in case X is measurable, a 
measurable function is one whose inverse maps the sets of one 
prescribed o-ring (namely the Borel sets on the line) into the sets 
of another prescribed o-ring (namely S). 

It is clear that the concept of measurability for a function 
depends on the o-ring S and therefore, on the rare occasions when 
we shall have more than one o-ring under consideration at the 
same time, we shall say that a function is measurable with respect 
to S, or, more concisely, that it is measurable (S). If in particular 
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X is the real line, and S and § are the class of Borel sets and the 
class of Lebesgue measurable sets respectively, then we shall call 
a function measurable with respect to S a Borel measurable 
function, and a function measurable with respect to § a Lebesgue 
measurable function. 

It is important to emphasize also that the concept of measur- 
ability for functions, just as the concept of measurability for sets, 
as used in §17, does not depend on the numerical values of a 
prescribed measure y, but merely on the prescribed o-ring S. 
A set or a function is, from this point of view, declared measurable 
by fiat; the concept is purely set theoretical and is quite inde- 
pendent of measure theory. 

The situation is analogous to that in the modern theory of 
topological spaces, where certain sets are declared open and cer- 
tain functions continuous, without reference to a numerical 
distance. The existence or non existence of a metric, in terms of 
which openness and continuity can be defined, is an interesting 
but usually quite irrelevant question. The analogy is deeper 
than it seems: the reader familiar with the theory of continuous 
functions on topological spaces will recall that a function / is 
continuous if and only if, for every open set M in the range 
(in our case the real line), the set f—'(M) belongs to the prescribed 
family of sets which are called open in the domain. 

We shall need the concept of measurability for extended real 
functions also. We define this concept simply by making the 
convention that the one-point sets {0} and { —o} of the extended 
real line are to be regarded as Borel sets, and then repeating 
verbatim the definition for real valued functions. Accordingly 
a possibly infinite valued function f is measurable, if, for every 
Borel set M of real numbers, each of the three sets 


f({o}), F7*-%}), and Nf) NSM) 


is measurable. We observe that for the extended concept of 
Borel set it is no longer true that the class of Borel sets is the o-ring 
generated by semiclosed intervals. 

We shall study and attempt to make clear the structure of 
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measurable functions in great detail below. The following is a 
preliminary result of considerable use. 


Theorem A. 4 real function f on a measurable space (X,S) 


is measurable if and only if, for every real number c, the set 
N(f) A {x2 f(x) < c} is measurable. 


Proof. If M is the open ray extending from ¢ to — on the 
real line, ic. M = {t:¢ <c}, then M is a Borel set andf—1(M) = 
{x: f(x) <c}. It is clear therefore that the stated condition is 
indeed necessary for the measurability of f. 

Suppose next that the condition is satisfied. If c,; and cp are 
real numbers, ¢; S ¢2, then 


{xs f(x) < co} — {xs f(x) <a} = {era S f(x) < ce}. 


In other words if M is any semiclosed interval, then N(f) N 
f-'(M) is the difference of two measurable sets and is therefore 
measurable. Let E be the class of all those subsets M of the 
extended real line for which N(f) N f~(M) is measurable. Since 
E is ao-ring, and since a o-ring containing all semiclosed intervals 
contains also all Borel sets, the proof of the theorem is complete. J 


(1) Theorem A remains true if < is replaced by S or > or 2. (Hint: if 
—%0 <¢ <0, then 


{x:f(x) Sc} = (Pea {a/e) oe +-| ) 


(2) Theorem A remains true if ¢ is restricted to belong to an everywhere 
dense set of real numbers. 

(3) If fis a measurable function and c is a real number, then cf is measurable. 

(4) If a set EZ is a measurable set, then its characteristic function is a measur- 
able function. Is the converse of this statement true? 

(5) A non zero constant function is measurable if and only if X is measurable. 

(6) If X is the real line and / is an increasing function, then / is Borel measur- 
able. Is every continuous function Borel measurable? 

(7) Let X be the real line and let Z be a set which is not Lebesgue measurable; 
write /(x) = * or —x according asxeE or xe’ E. Is f Lebesgue measurable? 

(8) If f is measurable, then, for every real number c, the set N(f) N 
{x: f(x) = c} is measurable. Is the converse of this statement true? 

(9) A complex valued function is called measurable if both its real and 
imaginary parts are measurable. A complex valued function / is measurable 
if and only if, for every open set M in the complex plane, the set N(f) N f-"(M) 
is measurable. 
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(10) Suppose that f is a real valued function on a measurable space (X,S), 
and, for every real number #, write B(t) = {x: f(x) St}. Then 


(10a) 5<¢ implies B(s) C BQ), 
(10b) U: B® =X and [): BY) =0, 
(10c) Ns< BY) = BGs). 


Conversely, if { B(¢)} is a class of sets with the properties (10a), (10b), and (10c), 
then there exists a unique, finite, real valued function f such that {x: f(x) S ¢} 
= B(t). (Hint: write f(x) = inf {#: x e B(t)}.) 

(11) If f is a measurable function on a totally finite measure space (X,S,p} 
and if, for every Borel set M on the extended real line, we write »(M) = 
u(f—1(M)), then v is a measure on the class of all Borel sets. If/ is finite valued, 
then the function g of a real variable, defined by g(t) = n({x: f(x) < #}), is 
monotone increasing, continuous on the left, and such that g(—%) = 0 and 
g(©) = p(X); g is called the distribution function of f. If g is continuous, then 
the Lebesgue-Stieltjes measure yz, induced by g according to the procedure of 
15.9, is the completion of ». If fis the characteristic function of a measurable 


set E, then v(M) = xau(1)u(E) + xm(O)u(E’). 


§ 19. COMBINATIONS OF MEASURABLE FUNCTIONS 


Theorem A. If f and g are extended real valued measurable 
functions on a measurable space (X,S), and if c is any real 
number, then each of the three sets 


A = {xi f(x) < g(x) + ¢}, 
B= {x: f(x) S g(x) + ¢}, 
C = {x: f(x) = g(x) +c}, 
has a measurable intersection with every measurable set. 


Proof. Let M be the set of rational numbers on the line. 
Since 


A= Ureml{x: f(x) <r} nN farr—e < g(x)}1], 


it follows that 4 has the desired property. The conclusions for 
B and C are consequences of the relations 


B=X — {x: g(x) < f(x) —c} and C=B-—A 


respectively. J 
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Theorem B. [Jf ¢ is an extended real valued Borel measur- 
able function on the extended real line such that (0) = 0, 
and if f is an extended real valued measurable function on a 
measurable space X, then the function f, defined by f(x) = 
o(f(*)), is @ measurable function on X. 


Proof. It is convenient to use here the definition of measur- 
ability (instead of the necessary and sufficient condition of § 18). 
If M is any Borel set on the extended real line, then 


Nf) NAAM) = {x: ox) © M — {0}} = 


= {x:f(«) ed "(M — {0})}. 
Since ¢(0) = 0, we have 
$M — {0}) = 914 — {0}) — {0}. 


Since ¢ is Borel measurable, ¢~1(M — {0}) is a Borel set and the 
measurability of the set 


Nf) NF-M) = Nf) AF*(G"(M — {0})) 


follows from the measurability of f. Jj 

Since it is easy to verify that, for any positive real number a, 
the function ¢, defined for every real number ¢ by $(¢) = | ¢|*, 
is Borel measurable, it follows that the measurability of a func- 
tion f implies the measurability of | f|*. Similarly any positive 
integral power of a measurable function is again a measurable 
function, and it follows similarly, by an even simpler argument, 
that a constant (real) multiple of a measurable function is also 
measurable. By considering Borel measurable functions ¢ of 
two or more real variables a similar argument may be used to 
prove such statements as that the sum and product of two measur- 
able functions are measurable. Since, however, we have not yet 
defined and proved any properties of Borel measurability for 
functions of several variables, we postpone these considerations 
and turn now to a direct proof of the measurability of sums and 
products. 


Theorem C. If f and g are extended real valued measurable 
functions on a measurable space X, then so also are f+ 


and fg. 
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Proof. Since the behavior of f + g and fg at those points x 
at which at least one of the two numbers, f(x) and g(x), is infinite 
is easily understood, after the examination of a small number of 
cases, we restrict our attention to finite valued functions. (We 
recall incidentally that if f(x) = + and g(x) = Fo, then 
f(x) + g(x) is not defined.) 


Since if f and g are finite and if ¢ is a real number, then 
{xi f(x) + g(x) <c} = [xs f(x) <c - g(x}, 


the measurability of f+ g follows from Theorem A (with —g 
in place of g). The measurability of fg is a consequence of the 


identity 
fe=U¢+?-U-»s)). I 


Since if f and g are finite we have 
fUg= 4 terlf—z) 
fNg=4¢+e-lf-e), 


Theorems B and C show that the measurability of f and g implies 
that of f Ug andf fg. If for every extended real valued func- 
tion f we write 


ft =fU0 and f- = -({N0), 
f=ft—f- and [fl =f*+f-. 


(The functions f* and f~ are called the positive part and the 
negative part of /, respectively.) The comment at the beginning 
of this paragraph implies that the positive and negative parts of 
a measurable function are both measurable; conversely, a func- 
tion with measurable positive and negative parts is itself measur- 


able. 


and 


then 


(1) If fis such that | | is measurable, does f have to be measurable? 

(2) If X is measurable, then Theorem B is true even without the assumption 
that (0) = 0; in other words, in this case a Borel measurable function of a 
measurable function is a measurable function. 
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(3) It is not true, even if X is measurable, that a Lebesgue measurable func- 
tion of a measurable function is a measurable function. The purpose of the 
sequence of statements below is to indicate the proof of this negative statement 
by the construction of a suitable example. The construction will yield a Lebesgue 
measurable function ¢ of a real variable y, and a continuous and strictly increas- 
ing function f of a real variable x, 0 S$ x S 1, such that if f(x) = (/(x)), then 
J is not Lebesgue measurable. 

For every x in X (where X = [0,1] is the closed unit interval), write 


x = Dt10;/3! = aos: +, 


wherea; = 0,1, or2,# = 1,2,---,sothatifxeC,thena; = Oor2,i = 1,2,---. 
(The set C is the Cantor set, defined in 15.5.) Let » = n(x) be the first index 
for which a, = 1. (If there is no such 4, i.e. if xe C, write (x) = ©.) Define 
the function ¥ by the equation 


¥(x) = Distcnas/2*4t +5 


(The function y is sometimes called the Cantor function.) 
(3a) IfO SxS y S1, then 


0 = ¥(0) S (x) S Hy) S WL) = 1. 


(Hint: if « = .ayaoas:-- Sy = .8i8o83---, and if a; = B; for 1 Si <j, then 
a; & B;.) 

: (3b) The function ¥i is continuous. (Hint: if x = .ayasas---, y = -BiBo83---, 
and a; = f; for 1 Si <j, then 


[¥e) -¥9) 1S ser) 


(3c) For every « in X there is one and only one number y, 0 S y S 1, such 
that x = 3(y + ¥(y)), and therefore the equation y = /(«) defines a strictly in- 
creasing, continuous function f on X. (Hint: 3(y + ¥(y)) is strictly increasing 
and continuous.) 

(3d) The set f —!(C) is Lebesgue measurable and has positive measure. (Hint: 


the set 
WX — C) = {Y(y): ye X — C} 
is countable and therefore has measure zero; consequently 
BS -MUX — C)) = $.) 


(3e) There exists a Lebesgue measurable set M, MC {y:0 Sy S$ 1}, such 
that f-1(M) is not Lebesgue measurable. (Hint: by 16.E, f—(C) contains a 
non measurable set. Recall that every subset of a set of Lebesgue measure zero 
is Lebesgue measurable.) 

_ (3f) If ¢ is the characteristic function of the set M mentioned in (3e) and if 
Hx) = 6U/()), then @ is Lebesgue measurable but / is not. 

(4) The set M in (3e) is an example of a Lebesgue measurable set which is 

not a Borel set; (cf. 15.6). 
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§20. SEQUENCES OF MEASURABLE FUNCTIONS 


Theorem A. [f {f,} is a sequence of extended real valued, 
measurable functions on a measurable space X, then each of the 
four functions h, g, f*, and fx, defined by 


A(x) = sup {fr(x): 7 = 1,2, ---f, 
g(x) = inf {f,(x*): 2 = 1,2, ---}, 
f*(«) = lim sup fa(*), 
f(x) = lim inf, fa(*); 
is measurable. 


Proof. It is easy to reduce the general case to the case of 
finite valued functions. The equation 


{x: g(x) <c} = Ura {er fae) <c} 


implies the measurability of g. The result for 4 follows from the 
relation 
A(x) = — inf { —fa(x): n=1,2,-- -}, 


The measurability of {* and fy is a consequence of the relations 


F*(x) = infret SUPm2nJm(*), Sx (%) = SUPrz1 infmz nfm(X), 


respectively. §f 
It follows from Theorem A that the set of points of convergence 
of a sequence {/,} of measurable functions, i.e. the set 


{x: lim supa fn(x) = lim inf, fn(x)}, 


has a measurable intersection with every measurable set, and, 
consequently, that the function f, defined by f(«) = lim, f,(*) 
at every x for which the limit exists, is a measurable function. 

A very useful concept in the theory of measurable functions is 
that of a simple function. A function f, defined on a measurable 
space X, is called simple if there is a finite, disjoint class {£,, ---, 
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E,} of measurable sets and a finite set {a1, ---, an} of real num- 
bers such that 


a; if «ek, i=1,---,2, 


fe) = {5 fel Ue 


(We emphasize the fact that the values of a simple function are 
to be finite real numbers: this will be essential in the sequel.) 
In other words a simple function takes on only a finite number 
of values different from zero, each on a measurable set. 

The simplest example of a simple function is the characteristic 
function xg of a measurable set E. It is easy to verify that a 
simple function is always measurable; in fact we have, for the 
simple function f described above, 


f(x) = Doles oixe,(x). 


The product of two simple functions, and any finite linear com- 
bination of simple functions, are again simple functions. 


Theorem B. Every extended real valued measurable function 
f is the limit of a sequence {f,} of simple functions; if f is non 
negative, then each f, may be taken non negative and the sequence 
{fn} may be assumed increasing. 


Proof. Suppose first that f20. For every m = 1, 2, ---, 
and for every « in X, we write 
-_ 1 a 
1 if i-1 
Sr(x) =} 2” 2” 
n if f(x) Zn. 


S f(x) <5 i=1, a0 tg 25M, 


Clearly fn is a non negative simple function, and the sequence 
{fa} is increasing. If f(x) < ©, then, for some 7, 
1 
0 Sf) —f/(*) S am 


if f(x) = ©, then f,(x) = m for every x. This proves the second 
half of the theorem; the first half follows (recalling that the 
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difference of two simple functions is a simple function) by apply- 
ing the result just proved separately toft andf-. § 


(1) All the concepts and results of this section and the preceding one (except, 
of course, the ones depending on such order properties of the real numbers as 
positiveness) can be extended to complex valued functions. 

(2) If the function f in Theorem B is bounded, then the sequence {f,} may 
be made to converge tof uniformly. 

(3) An elementary function is defined in the same way as a simple function, 
the only change being that the number of sets Z;, and therefore the number of 
corresponding values a@;, is allowed to be countably infinite. Every real valued 
measurable function f is the limit of a uniformly convergent sequence of ele- 
mentary functions. 


§21. PoINTWISE CONVERGENCE 


In the preceding three sections we have developed the theory 
of measurable functions about as far as it is convenient to do so 
without mentioning measure. From now on we shall suppose 
that the underlying space X is a measure space (X,S,y). 

If a certain proposition concerning the points of a measure 
space is true for every point, with the exception at most of a 
set of points which form a measurable set of measure zero, it is 
customary to say that the proposition is true for almost every 
point, or that it is true almost everywhere. The phrase ‘“‘almost 
everywhere’’ is used so frequently that it is convenient to intro- 
duce the abbreviation a.e. Thus, for instance, we might say that 
a function is a constant a.e—meaning that there exists a real 
number ¢ such that {x: f(x) ¥ c} is a set of measure zero. A 
function f is called essentially bounded if it is bounded a.e., i.e. 
if there exists a positive, finite constant ¢ such that {x:| f(x) | > c} 
is a set of measure zero. The infimum of the values of ¢ for which 
this statement is true is called the essential supremum of | /|, 
abbreviated to 


ess. sup. | f |. 


Let {f,} be a sequence of extended real valued functions which 
converges a.e. on the measure space X to a limit function /f. 
This means, of course, that there exists a set Ey of measure zero 
(which may be empty) such that, if x ©’ Ky and e > 0, then an 
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integer % = mo(x,e) can be found with the property that 


Fn (x) <> -, if I(x) tO, 
[fa(x) —f(x)| <6 if -—2 <f(x) <a, 
fale) > 4, if fa) =, 


whenever » 2. We shall say that a sequence {/,} of real 
valued functions is fundamental a.e. if there exists a set Eo of 
measure zero such that, if x e’Ey and e > 0, then an integer 
Mo = No(x,e) can be found with the property that 


| fn(x) — fim(x) | <«, whenever 22m) and m2 mn. 


Similarly in the theory of real sequences one distinguishes between 
a sequence {a,} of extended real numbers which converges to an 
extended real number a, and a sequence {a,,} of finite real numbers 
which is a fundamental sequence, i.e. which satisfies Cauchy’s 
necessary and sufficient condition for convergence to a finite 
limit. 

It is clear that if a sequence converges to a finite valued limit 
function a.e., then it is fundamental a.e., and, conversely, that 
corresponding to a sequence which is fundamental a.e. there 
always exists a finite valued limit function to which it converges 
a.e. If moreover the sequence converges a.e. to f and also con- 
verges a.e. to g, then f(x) = g(x) a.e., i.e. the limit function is 
uniquely determined to within a set of measure zero. 

We shall have occasion in the sequel to refer to several differ- 
ent kinds of convergence, and we shall consistently employ 
terminology similar to that of the preceding paragraphs. Thus, 
if we define a new kind of convergence of a sequence {f,} to a 
limit f, by specifying the sense in which f, is to be near tof for 
large », then we shall use without any further explanation the 
notion of a sequence which is fundamental in this new sense— 
meaning that, for large 7 and m, the differences fn — fm are to 
be near to 0 in the specified sense of nearness. 

An example of another kind of convergence for sequences of 
real valued functions is uniform convergence a.e. The sequence 
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{ f,} converges tof uniformly a.e. if there exists a set Ey of meas- 
ure zero such that, for every ¢ > 0, an integer m = mo(e) can be 
found with the property that 


[fn(x) —f(x)|<e, if nm2m and xe’ Ey, 


in other words if the sequence of functions converges uniformly 
to f (in the ordinary sense of that phrase) on the set X — Ep. 
Once more it is true, and easily verified, that a sequence con- 
verges uniformly a.e. to some limit function if and only if it is 
uniformly fundamental a.e. 

The following result (known as Egoroff’s theorem) establishes 
an interesting and useful connection between convergence a.e. 
and uniform convergence. 


Theorem A. Jf E is a measurable set of finite measure, 
and if {fn} is @ sequence of a.e. finite valued measurable func- 
tions which converges a.e. on E to a finite valued measurable 
function f, then, for every € > 0, there exists a measurable 
subset F of E such that p(F) < ¢ and such that the sequence 
{fa} converges to f uniformly on E — F. 

Proof. By omitting, if necessary, a set of measure zero from 
E, we may assume that the sequence {f,} converges to f every- 


where on E. If 
1 
Ext = Nin fe LAs) — sf) < <1, 


then 
E,\™ < Eo™ C---, 


and, since the sequence { f,} converges to f on E, 
lim, £," D E 


for every m = 1, 2, ---. Hence lim, u(E — E,”) = 0, so that 
there exists a positive integer 7 = mo(m) such that 


€ 
ME — Engin”) < on 
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(To be sure 7 depends also on ¢, but € remains fixed throughout 
the entire proof.) If 


Fe= Un-: (E a Engm”)s 
then F is a measurable set, F c E, and 


WP) = wh U mer (E — Eng”) S Dimer ME — Enon”) < e. 


Since E — F = EN (Yinns Engen”, and since, therefore, for 7 = 
mo(m) and for x in E — F, we have xe E,”, it follows that 


| fa(x) — f(x) | < < , which proves uniform convergence on 
E-F. j 

Motivated by Egoroff’s theorem we introduce the concept of 
almost uniform convergence. A sequence {f,} of a.e. finite 
valued measurable functions will be said to converge to the meas- 
urable function f almost uniformly if, for every e > 0, there 
exists a measurable set F such that u(F) < e and such that the 
sequence {/,} converges to f uniformly on F’. In this language 
Egoroff’s theorem asserts that on a set of finite measure con- 
vergence a.e. implies almost uniform convergence. The following 
result goes in the converse direction. 


Theorem B. Jf {f,} is a sequence of measurable functions 
which converges to f almost uniformly, then { f,} converges to f 
a.e. 


Proof. Let F, be a measurable set such that u(F,) <* 
and such that the sequence {/,} converges to f uniformly on 
F,/,n = 1,2,---. If F = (\poi Fn, then 

1 
u(F) Ss B(F x) < a 


so that u(F) = 0, and it is clear that, for x in F’, {fn(«)} con- 
verges tof(x). fj 

We remark that the phrase “almost uniform convergence” 
is a somewhat confusing (but unfortunately standard) misnomer, 
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which conflicts with the ‘‘almost everywhere” terminology. Some 
such phrase as “‘nearly uniform convergence” might come closer 
to suggesting the true state of affairs; as it stands, some care 
has to be exercised to distinguish between almost uniform con- 
vergence and almost everywhere uniform convergence. 


(1) If f is any real valued, Lebesgue measurable function on the real line, 
then there exists a Borel measurable function g such that f(x) = g(x) ace. 
(Hint: write E, = {x: f(x) <r} for every rational number r, and use 13.B to 
express E, in the form F, A N,, where F; is a Borel set and N,; has measure zero. 
Let N be a Borel set of measure zero containing LU, N; and define g by 


eee i if xeN, 
BT lx) if, xe’ N. 
Cf. 18.2.) 

(2) If E is a measurable set of positive finite measure, and if { f,} is a sequence 
of a.e. finite valued measurable functions which is fundamental a.e., then there 
exists a positive finite constant ¢ and a measurable subset F of E of positive 
measure such that, for every = 1, 2, --- and for every x in F, | fa(x) | Sc 

(3) If E is a measurable set of o-finite measure, and if {fn} is a sequence of 
a.e. finite valued measurable functions which converges a.e. on E to a finite 
valued measurable function f, then there exists a sequence {£;} of measurable 
sets such that u(E — U7. E,) = 0 and such that the sequence {f,} converges 
uniformly on each £;, i = 1, 2, ---. (Hint: it is sufficient to prove the result 
if u(E) <0. In this case apply Egoroff’s theorem to find £; so that 


1 
ME — UP. BE) < 


and so that {f,} converges uniformly on E;.) 

(4) Let X be the set of positive integers, let S be the class of all subsets of 
X, and, for E in S, let u(E) be the number of points in E. If x, is the character- 
istic function of the set {1, ---, }, then the sequence {xn} converges to 1 
everywhere but it is not almost uniformly fundamental. In other words, 
Egoroff’s theorem is not true if £ is not of finite measure. 

(5) For every essentially bounded function f, write || /|| = ess. sup. [f|. 
If {fn} is a sequence of essentially bounded measurable functions, then the 
sequence { f,} converges tof uniformly a.e. if and only if lima || fn —/ || = 0. 

(6) Is the set 9% of all essentially bounded measurable functions a Banach 
space with respect to the norm described in (5)? 


§22. CONVERGENCE IN MEASURE 


In this section, as in the preceding one, we shall work through. 
out with a fixed measure space (X,S,x). 
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Theorem A. Suppose that f andf,,n = 1,2, ---, are real 
valued measurable functions on a set E of finite measure, and 
write, for every « > 0, 


E,(e) = {x:|fa(x) —f(%) | 2, 2=1,2,---. 
The sequence {fn} converges to f a.e. on E if and only if 
lima ME A nan En(e)) = 0 
for every € > 0. 


Proof. It follows from the definition of convergence that the 
sequence { f,(x)} of real numbers fails to converge to the real 
number f(x) if and only if there is a positive number e such that 
x belongs to E,(e) for an infinite number of values of 7. In 
other words, if D is the set of those points x at which { f,(x)} does 
not converge to f(x), then 


1 
DS Uclmease b= Vein sipee: (;): 


Consequently a necessary and sufficient condition that p(E N D) 
= 0 (i.e. that the sequence { f,} converge tof a.e. on £) is that 
u(E 2 lim sup, E,(e)) = 0 for every « > 0. The desired conclu- 
sion follows from the relations 


w(E 1 lim sup, En(e)) = w(E N ()R-1 Uren En(e)) = 
=lim, (EN Un-, Enle)). O 


The desire to investigate the result of an obvious weakening 
of the condition of Theorem A motivates the definition of still 
another method of convergence which has frequent application. 
A sequence {f,} of a.e. finite valued, measurable functions con- 
verges in measure to the measurable function / if, for every 
e > 0, lim, w({x: | fa(x) — f(x) | 2 ef) = 0. In accordance with 
our general comment on different kinds of convergence in the 
preceding section, we shall say that a sequence {f,} of a.e. 
finite valued measurable functions is fundamental in measure 
if, for every € > 0, 


u({x: |falx) —fn(x) | 2 e}) 20 as mand m > © 
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It follows trivially from Theorem A that if a sequence of 
finite valued measurable functions converges a.e. to a finite 
limit [or is fundamental a.e.] on a set E of finite measure, then it 
converges in measure [or is fundamental in measure] on E. The 
following theorem is a slight strengthening of this assertion in 
that it makes no assumptions of finiteness. 


Theorem B. Almost uniform convergence implies conver- 
gence in measure. 


Proof. If {f/,} converges to f almost uniformly, then, for 
any two positive numbers e and 6, there exists a measurable set 
F such that »(F) < 6 and such that | f,(~) — f(x) | < « whenever 
x belongs to F’ and is sufficiently large. J 


Theorem C. Jf {fn} converges in measure to f, then {fn} 
is fundamental in measure. If also { f,} converges in measure 
to g, then f = g ae. 


Proof. The first assertion of the theorem follows from the 
relation 


fx: | fa(x) —fm(x) | 2 ef 
< {x1 A@) — 400) & 4} u fer Leate) — 0) | 2 SI. 


To prove the second assertion, we observe that, similarly, 


{x: | f(x) — g(x) |Z dic 
cfs: [falx) — fe) | 2 ‘| U [¥: |fa(x) — g(x) | = <|. 


Since, by proper choice of 1, the measure of both sets on the 
right can be made arbitrarily small, we have 


u({x: | f(x) — g(x) | 2 ef) = 0 


for every € > 0; this implies, as asserted, thatf=gae. J 

In addition to these comparatively elementary remarks, we 
shall present two slightly deeper properties of convergence in 
measure. 
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Theorem D. Jf { f,} is a sequence of measurable functions 
which is fundamental in measure, then some subsequence { fry} 
ts almost uniformly fundamental. 


Proof. For any positive integer k we may find an integer 7(k) 
such that if 1 2 7(k) and m = 7i(k), then 


u({a: Lae) —Jale) 12 }) <p 
We write 


m = 71), m2 = (m +1) U A(2), mg = (m2 +1) U AGB), «°°; 


then ™ < mz <3 <---, so that the sequence {f,,} is indeed an 
infinite subsequence of {f,}. If 


Ey = {»: | foal) — foa#) | 2 7| 


and k Si Sj, then, for every x which does not belong to E;, U 
FExyi U Exye U---, we have 


| fo(*) — Silx) | s poe lFom(*) — = Fazal) | rer 5 ’ 


so that, in other words, the sequence {f,,} is uniformly funda- 
mental on X — (Ey U Ex4, U---). Since 


1 
u(y U Ey: U-++) S Dinar w(Em) < Ra? 
the proof of Theorem D is complete. §f 


Theorem E. Jf {fn} is a sequence of measurable functions 
which is fundamental in measure, then there exists a measurable 
function f such that { f,} converges in measure to f. 


Proof. By Theorem D we can find a subsequence { f,,} which 
is almost uniformly fundamental and therefore fundamental a.e.; 
we write f(x) = lim: f,,(*) for every x for which the limit exists. 
We observe that, for every e > 0, 


(ell) —f) 12 Je 
{ez [fale) — fale) | & S} U fx: Leute) 10 | 2 3}. 
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The measure of the first term on the right is by hypothesis arbi- 
trarily small if 7 and m, are sufficiently large, and the measure of 
the second term also approaches 0 (as k — ~), since almost 
uniform convergence implies convergence in measure. J 


(1) Suppose that the measure space (X,S,u) is totally finite, and let { fn} 
and {gn} be sequences of finite valued measurable functions converging in meas- 
ure to f and g respectively. 

(1a) If a and B are real constants, then {af, -+ ga} converges in measure to 
of + Bg; {| fn |} converges in measure to | /|. 

(1b) Iff = Oa.e., then {f,7} converges in measure tof”. 

(Ic) The sequence {fag} converges in measure to fg. (Hint: given a positive 
number 6, find a constant ¢ such that if E = {x: | g(x) | Sc}, then u(X — E) 
< 6, and ‘consider the situation separately on E and X — E.) 

(a) | he sequence {f,?} converges in measure to f2, (Hint: apply (1b) to 
fa — 

(le) Le sequence { fagn} converges in measure to fg. (Hint: apply the 
identity which expresses a product in terms of sums and squares.) 

(1f) Are the statements (la)-(le) valid for measure spaces which are not 
totally finite? 

(2) Every subsequence of a sequence which is fundamental in measure is 
fundamental in measure. 

(3) If {fn} is a sequence of measurable functions which is fundamental in 
measure, and if {f,,} and {fm;} are subsequences which converge a.e. to the 
limit functions f and g respectively, then f = g a.e. 

(4) If X is the set of positive integers, S is the class of all subsets of X, and, 
for every E in S, u(£) is the number of points in E, then, for the measure space 
(X,S,u), convergence in measure is equivalent to uniform convergence every- 
where. 

(5) Is it necessarily true on a set of infinite measure that convergence a.e. 
implies convergence in measure? (Cf. 21.4 and (4).) 

(6) Let the measure space X be the closed unit interval with Lebesgue meas- 
ure. If, form = 1,2, ---, 

i-1 i 


Ei = [ n ,<|, t=1,---,%, 


and if xn‘ is the characteristic function of En’, then the sequence {x1', x2, x2?, 
xs!, xs”, xa°, ---} converges in measure to 0, but fails to converge at any point 


(7) Let {En} be a sequence of measurable sets and let xn be the character- 
istic function of En, = 1,2, -+-. The sequence {xn} is fundamental in measure 
if and only if p(E,,En) — Oas and m — . (For the definition of p see 9.4.) 


Chapter V 


INTEGRATION 


§23. INTEGRABLE SIMPLE FUNCTIONS 


A simple function f = )o?.1 asxg, on a measure space (X,S,u) 
is integrable if u(E;) < © for every index i for which a; ¥ 0. 
The integral of /, in symbols 


frerducs) or fifdu 
is defined by fi du = ., a(E,). It follows easily from the ad- 


ditivity of u that if f is also equal to 071 B xr, then f fap = 


D7. Bye(F;), ice. that the value of the integral is independent 
of the representation of f and is therefore unambiguously defined. 
We observe that the absolute value of an integrable simple func- 
tion, a finite, constant multiple of an integrable simple function, 
and the sum of two integrable simple functions are integrable 
simple functions. 

If E is a measurable set and f is an integrable simple function, 
then it is easy to see that the function xz/f is an integrable simple 
function also; we define the integral of f over E by 


fsa = f xefdu. 


The simplest example of an integrable simple function is the 
characteristic function of a measurable set E of finite measure; 


we have | xgdu = J du = p(Z£). 
95 
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In the sequel we shall define the notions of integrability and 
integral on a wider domain than the class of integrable simple 
functions. Some useful definitions and the statements of several 
important results (but very few proofs) depend only on such 
elementary properties of integration as we have already explicitly 
mentioned. In order to avoid unnecessary duplication, we shall 
therefore proceed as follows. Throughout this section we shall 
use the word “function” as an abbreviation for “‘simple function.” 
As a consequence of this policy all our definitions and theorems 
will make sense not only for simple functions but also for the 
wider class we shall subsequently consider. The proofs in this 
section will, however, apply to simple functions only; we shall 
complete the proofs, so that they will apply to the more general 
case also, a little later. 

The proofs of Theorem A and B below are omitted; these 
results are immediate consequences of the definitions and, in the 
case of Theorem A, an obvious and simple computation. 


Theorem A. If f and g are integrable functions and a and 
B are real numbers, then 


fs + Bg)du = af fdu + Bf edu. 
Theorem B. Jf an integrable function f is non negative 
a.e., then | fd 2 0. 


Theorem C. If f and g are integrable functions such that 


f 2 g 4@.e., then 
feu & feau. 


Proof. Apply Theorem B tof — gin place off. §j 
Theorem D. If f and g are integrable functions, then 


fis +eldu s fi slaw + fle lau. 


Proof. Apply Theorem C to |f| +] g| and |/+ g| in place 
of f and g, respectively. 
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Theorem E. If f is an integrable function, then 


Ifa | s [if du. 


Proof. Apply Theorem C first to | f| and f and then to |f| 
and —f. Jj 


Theorem F. If f is an integrable function, a and B are real 
numbers, and E is a measurable set such that, for x in E, 
a S f(x) S B, then 


ou(E) S f fdu S Bul). 


Proof. Since the principal assumption is equivalent to the 
relation axe S xzf S Bxz, the desired result follows from 
Theorem C if w(EZ) < 0; the case in which u(E) = & is easily 
treated by direct application of the definition of integrability. J 

The indefinite integral of an integrable function f is the set 


function v, defined for every measurable set E by »(E) = f Sd. 


Theorem G. If an integrable function f is non negative a.e., 
then its indefinite integral is monotone. 


Proof. If E and F are measurable sets such that Ec F, 
then xef S xr/fa.e., and the desired result follows from Theorem 
Cc. 

A finite valued set function v defined on the class of all measur- 
able sets of a measure space (X,S,u) is absolutely continuous if 
for every positive number e¢ there exists a positive number 6 such 
that | »(E) | < ¢ for every measurable set E for which u(E) < 6. 


Theorem H. The indefinite integral of an integrable function 
ts absolutely continuous. 


Proof. If ¢ is any positive number greater than all the values 
of | f|, then, for every measurable set E, we have 


[fa <c(E). I 
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Theorem I. The indefinite integral of an integrable func- 
tion 1s countably additive. 


Proof. If f is the characteristic function of a measurable set 
E of finite measure, then the assertion of countable additivity for 
the indefinite integral of f is just a restatement of the countable 
additivity of 4 on measurable subsets of E. The assertion of the 
theorem for arbitrary integrable simple functions is a consequence 
of the fact that every such function is a finite linear combination 
of characteristic functions. § 

If f and g are integrable functions, we define the distance, 
e(f,g), between them by the equation 


(fa) = [LF - g Iau. 


The function p deserves the name “‘distance” in every respect but 
one. It is true and trivial that 


e(f,f) = 90, p(fig) = o(g,f), and p(fg) S o(g,4) + (A, f). 


It is not true, however, that if p(fg) = 0, then f = g. The dis. 
tance between two integrable functions can, for instance, vanish 
if they are equal almost everywhere (but not necessarily every- 
where). In a subsequent section we shall study this phenomenon 
in some detail. 


(1) If one of two simple functions is integrable, then so is their product. 

(2) If E and F are measurable sets of finite measure, then p(xz,xr) = 
u(E AF). Cf. 9.4 and 22.7. 

(3) Let (X,S,u) be the closed unit interval with Lebesgue measure, and, for 
some fixed point xo in X, write »(E) = xz(xo). Is the set function » absolutely 
continuous? 

(4) If» is an absolutely continuous set function on the class of all measurable 
sets of a measure space (X,S,u), then y(E) = 0 for every measurable set E for 
which u(E) = 0. 

(5) If a totally finite measure space X consists of a finite number of points, 
then every real valued measurable function on X is an integrable simple func- 
tion, and the theory of integration specializes to the theory of finite sums. 


§24,. SEQUENCES OF INTEGRABLE SIMPLE FUNCTIONS 


We shall continue in this section to work with a fixed measure 
space (X,S,u), and to use the device of abbreviating “simple 
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function” to “function.” Since all the methods of this section 
(with only one minor exception, occurring at the end of the proof 
of Theorem D) are based on the general results of the preceding 
section, it will turn out that not only the statements but even the 
proofs of the following theorems will remain unaltered when we 


turn to the general case. 
A sequence { f,} of integrable functions is fundamental in the 
mean, or mean fundamental, if 


p(fafm) 20 as n and m — o, 


Theorem A. 4 mean fundamental sequence { fn} of integra- 
ble functions is fundamental in measure. 


Proof. If, for any fixed positive number e, 
Enm = {x:| fal) — fm(x) | = ef, 


then 


PCS ns fm) = [ifn sane |du zi lf — Sm |dp 2 eu(Enm), 


so that u(Enm) ~ Oasmandm oo, § 


Theorem B. Jf {f,} is a mean fundamental sequence of 
integrable functions, and if the indefinite integral of fn 15 Vny 
n=1,2,---, then 

v(E) = lim, vn(£) 
exists for every measurable set E, and the set function v 1s finite 
valued and countably additive. 


Proof. Since | va(E) — ym(E) | S f \fe — fu [du —> Oas and 


m — , the existence, finiteness, and uniformity of the limit are 
clear, and it follows from the finite additivity of limits that » is 
finitely additive. If {£,} is a disjoint sequence of measurable sets 
whose union is E, then we have, for every pair of positive integers 


nandk 
| (E) — Dk. »(E)| Ss 
S | »(E) — va(E) | + | on(Z) — Deir vn(E,) | 
+ | va(Uin E;) - (Ui E)) |. 
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The first and third terms of the right side of this inequality may 
be made arbitrarily small by choosing 7 sufficiently large, and, 
for fixed ”, the middle term may be made arbitrarily small by 
choosing & sufficiently large. This proves that 


v(E) = lim, Dini (E)) = Di (E). | 


If {»,} is a sequence of finite valued set functions defined for 
all measurable sets, we say that the terms of the sequence are 
uniformly absolutely continuous whenever for every positive num- 
ber ¢ there exists a positive number 5 such that | »,(E) | < e for 
every measurable set E for which p(E) < 4, and for every positive 
integer 7. 

Theorem C. Jf {fn} is a mean fundamental sequence of 
integrable functions, and if the indefinite integral of fn is Vny 
n= 1,2, ---, then the set functions v, are uniformly absolutely 
continuous. 


Proof. If « > 0, let mp be a positive integer such that, for 
n Zn and m2 nm, 


Slits —fnldu <5, 
and let 5 be a positive number such that 
a ee, n=, +++, mM 
E es 


for every measurable set E for which u(E) < 4; (cf. 23.H). If 
E is a measurable set for which u(E) < 6 and if 7 S mp, then 


|».(E) | < df fn Idu <6 


if, on the other hand, ” > mo, then 


| va() |S flfa — foo ldn + fifo ld <e 


Since the following theorem is of no particular importance in 
the general case, we shall restrict its statement and proof to the 
case of simple functions only. 
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Theorem D. Jf {fn} and {g,} are mean fundamental 
sequences of integrable simple functions which converge in meas- 
ure to the same measurable function f, if the indefinite integrals 
of fn and Zn are Vy and d, respectively, and if, for every measur- 
able set E, 


vV(E) = limnva(E) and dE) = lim, ),(E£), 
then the set functions v and ) are identical. 
Proof. Since, for every e« > 0, 
En = {x: | fale) — ga(x) | 2 ef 


cfelaw —sol 2s} u fete — les], 


it follows that lim, u(Z,) = 0. Hence, if E is a measurable set 
of finite measure, then in the relation 


fis — [du = — & [4p +f | |au +f gn [du 


the first term on the right is dominated by eu(Z), and the last 
two terms can be made arbitrarily small by choosing ” sufficiently 
large, because of the uniform absolute continuity proved in 
Theorem C. It follows that 


lim, | vn(E) — An(E) | = 0, 


and hence that »(E) = X(E). Since v and d are both countably 
additive, it follows that »(Z) = X(£) for every measurable set 
E of o-finite measure. 

Since the f, and g, are simple functions, each of them is defined 
in terms of a finite class of measurable sets of finite measure. 
If Ey is the union of all sets in all these finite classes, then Ep is 
a measurable set of o-finite measure, and we have, for every 
measurable set E, 


rni(E = Eo) = An(E _- Eo) =0 


and therefore »(E — Ey) = X(E — Ey) = 0. Since this implies 
that »(Z) = (ENE) and XE) = NEN £), the proof of 
Theorem D is complete. J 
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(1) Is the set of all integrable simple functions a complete metric space with 
respect to the distance p? 

(2) In the notation of Theorem B, if {E,} is a disjoint sequence of measurable 
sets, then the series }-4-1 (En) converges absolutely. (Hint: the series con- 
verges unconditionally.) 


§ 25. INTEGRABLE FUNCTIONS 


An a.e. finite valued, measurable function f on a measure 
space (X,S,u) is integrable if there exists a mean fundamental 
sequence {/,} of integrable simple functions which converges in 
measure tof. The integral of /, in symbols 


ff (x)du(x) or f Sap 


is defined by i. Jadu = lim, f frdu. It follows from 24.D (with E 


= U,N(f,)) that the value of the integral of f is uniquely de- 
termined by any particular such sequence. We emphasize the 
fact that the value of the integral is always finite. We observe 
that it follows from the known and obvious properties of mean 
convergence and convergence in measure that the absolute value 
of an integrable function, a finite constant multiple of an in- 
tegrable function, and the sum of two integrable functions are 
integrable functions. The relations 


ft =3f|+/) and f= 30/1 -S) 


show also that if f is integrable, then ft and f~ are integrable. 

If E is a measurable set and if {f,} is a mean fundamental 
sequence of integrable simple functions converging in measure 
to the integrable function /, then it is easy to see that the sequence 
{xefn} is mean fundamental and converges in measure to xz/. 
We define the integral of f over E by 


J pau = frosdu 


We recall that the theorems of §§ 23 and 24 were stated for 
general integrable functions but were proved for integrable simple 
functions only. We are now in a position to complete their 
proofs. 
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The results 23.A and 23.B follow immediately from elementary 
properties of limits; 23.C-23.G follow from 23.B verbatim as 
before. 

To prove the absolute continuity of an indefinite integral, 
23.H, let {f,} be a mean fundamental sequence of integrable 
simple functions which converges in measure to the integrable 
function f. We have 


If fad Af fad + Af fat - f saul, 


for every measurable set EF. Since the /, are simple functions, 
the theorem 24.C on uniform absolute continuity may be applied 
to prove that the first term on the right becomes arbitrarily 
small if the measure of E is taken sufficiently small. The second 
term on the right approaches 0 as x — ©, by the definition of 


f Jap; this completes the proof of 23.H. 


The proof of the countable additivity of an indefinite integral 
is even simpler. Indeed, using the notation of the preceding 
paragraph, the fact that the f, are simple functions justifies the 
application of 24.B, which then yields exactly the assertion of 
23.1, 

The proofs of 24.A—24.C were based on the statements, and 
not on the proofs, of the results of § 23, and are therefore valid in 
the general case. This remark completes the proofs of all the 
theorems of the preceding two sections. 

We shall say that a sequence {f,} of integrable functions 
converges in the mean, or mean converges, to an integrable 
function if 


pfasf) = [lf —fldu 0 as mo, 


Our first result concerning this concept is extremely similar, in 
statement and in proof, to 24.A. 


Theorem A. Jf {fn} is a sequence of integrable functions 
which converges in the mean to f, then { fn} converges to f in 
measure. 
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Proof. If, for any fixed positive number e, 


E, = {x:|fa(x) —f(x) | 2 et, 


then 


lie Flu =f lfa —S du 2 u(Es), 


so that w(E,) > 0Oasn > wo § 
Theorem B. If f is an a.e. non negative integrable function, 
then a necessary and sufficient condition that f fdu = 0 is that 
f=O0 ae. 


Proof. If f = 0 a.e., then the sequence each of whose terms 
is identically zero is a mean fundamental sequence of integrable 
simple functions which converges in measure to /, and it follows 


that f fdu = 0. To prove the converse, we observe that if {/n} 


is a mean fundamental sequence of integrable simple functions 
which converges in measure to f, then we may assume that 
fn = 0, since we may replace each f, by its absolute value. The 


assumption f Jadu =0 implies that lim, fi frdu = 0, ie. that 


{fn} mean converges to 0. It follows from Theorem A that 
{fn} converges to 0 in measure and hence the desired result is 


implied by 22.C. J 
Theorem C. If f is an integrable function and E is a set of 


measure zero, then 
f jes. 
E 


Proof. Since f fap = f xzfdu, and since the characteristic 
E 


function of a set of measure zero vanishes a.e., the desired result 
follows from Theorem B. J 


Theorem D. If f is an integrable function which is positive 
a.e. on a measurable set E, and if f Jfdu = 0, then w(E) = 0. 
E 
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Proof. We write Fy = {x:f(x) > 0} and F, = {fl 2 | - 


n = 1, 2, ---; since the assumption of positiveness implies that 
E — Fy is a set of measure zero, we have merely to prove that 
Ef Fo is one also. Since 


1 
o=f fdu = —~w(EN F,) = 0, 
EQFa n 


and since Fy = Ure, Fa, the desired result follows from the 
relation (EN Fo) S Drei w(ENF,). | 


Theorem E. Jf f is an integrable function such that 
f fdu = 0 for every measurable set F, then f = 0 a.e. 
F 


Proof. If E = {x: f(x) > O}, then, by hypothesis, f fd = 0, 
z 


and therefore, by Theorem D, EF is a set of measure zero. Applying 
the same reasoning to —f shows that {x: f(x) < 0} is a set of 
measure zero. ff 


Theorem F. Jf f is an integrable function, then the set 
N(f) = {x:/(*) ¥ 0} has o-finite measure. 


Proof. Let { f,} be a mean fundamental sequence of integrable 
simple functions which converges in measure to f. For every 
n=1,2, ---, N(/) is a measurable set of finite measure. If 
E = N(f) — Us-1 NU), and if F is any measurable subset of 
E, then it follows from the relation 


JS fau = limns f fadu = 0 


and Theorem E that f = 0 ae. on E. In view of the definition 
of N(f) this implies that u(Z) = 0; we have 
Nf) co Uns NG) UE | 


It is frequently useful to define the symbol f Jfdp» for certain 


non integrable functions f. If, for instance, f is an extended real 
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valued, measurable function such that f 2 0 ae. and if f is not 
integrable, then we write 
ffen = «. 


The most general class of functions f for which it is convenient 
to define f Jd is the class of all those extended real valued measur- 


able functions f for which at least one of the two functions f+ 
and f~ is integrable; in that case we write 


Sfeu = frrau — fpr 


Since at most one of the two numbers, f Std and f frdp, is 


infinite, the value of f fdu is always +0, —o, or a finite real 


number—it is never the indeterminate form oo — o, We shall 
make free use of this extended notion of integration, but we shall 
continue to apply the adjective “integrable” to such functions 
only as are integrable in the sense of our former definitions. 


(1) If X is the space of positive integers (described, for instance, in 22.4), 
then a function / is integrable if and only if the series )>n=1 | f() | is convergent, 


and, if this condition is satisfied, then f fd = 2.1 f(n). 


(2) Iff is a non negative integrable function, then its indefinite integral is a 
finite measure on the class of all measurable sets. 
(3) If fis integrable, then, for every positive number e, 


u(fx: [fle | 2 gh) < @. 


(4) If gis a finite, increasing, and continuous function of a real variable, and 
fi, is the Lebesgue-Stieltjes measure induced by g (cf. 15.9), and if f is a function 


which is integrable with respect to this measure, then the integral {/(*)dig(x) 

is called the Lebesgue-Stieltjes integral of f with respect to g and is denoted by 
+o 

f S()dg(x). If, in particular, g(x) = x, then we obtain the Lebesgue integral, 


denoted by f Aware. If f is a continuous function such that N(/) is a bounded 
set, then f is Lebesgue integrable. 
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§ 26. SEQUENCES OF INTEGRABLE FUNCTIONS 


Theorem A. If {fn} is a mean fundamental sequence of 
integrable simple functions which converges in measure to the 
integrable function f, then 


o(f.fn) =flf-t lau > 0 as no; 


hence, to every integrable function f and to every positive num- 
ber «, there corresponds an integrable simple function g such 
that p( fig) <e. 


Proof. For any fixed positive integer m, {| fn — fm |} is a mean 
fundamental sequence of integrable simple functions which con- 
verges in measure to | f — f, |, and, therefore, 


P\s Fo he = Tiss fi fn — fos Lee 
The fact that the sequence { f,} is mean fundamental implies the 
desired result. §f 


Theorem B. Jf {f,} is a mean fundamental sequence of 
integrable functions, then there exists an integrable function f 


such that p(faf) — 0 (and consequently fi trap > f fdu) as 


n> ©, 

Proof. By Theorem A, for each positive integer 7 there is an 
, P 1 
integrable simple function g, such that p(/,,2,) < o It follows 


that {g,} is a mean fundamental sequence of integrable simple 
functions; let f be a measurable (and therefore integrable) func- 
tion such that {g,} converges in measure to f. Since 


0.5 | [fadu — [fdu| 5 [fa —Sldu = Unf) 8 


s P(fnsSn) = p(gnsf), 
the desired result follows from Theorem A. J 
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In order to phrase our next result in a concise and intuitive 
fashion, we recall first the definition of a certain kind of con- 
tinuity for set functions. A finite valued set function » on a class 
E of sets is continuous from above at 0 (cf. § 9) if, for every de- 
creasing sequence {E,} of sets in E for which lim, E, = 0, we 
have lim, »(E,) = 0. If {vn} is a sequence of such finite valued 
set functions on E, we shall say that the terms of the sequence 
are equicontinuous from above at 0 if, for every decreasing se- 
quence {£,} of sets in E for which lim, E, = 0, and for every 
positive number e, there exists a positive integer mp such that if 
m = mo, then | »,(Em) | <e2 =1,2, °°. 


Theorem C. 4 seguence {fn} of integrable functions con- 
verges in the mean to the integrable function f if and only if 
{fn} converges in measure to f and the indefinite integrals of 
|fn|, 2 = 1, 2, «++, are uniformly absolutely continuous and 
equicontinuous from above at 0. 


Proof. We prove first the necessity of the conditions. Since 
convergence in measure and uniform absolute continuity follow 
from 25.A and 24.C respectively, we have only to prove the as- 
serted equicontinuity. 

The mean convergence of { f,} tof implies that to every positive 
number e¢ there corresponds a positive integer 7 such that if 


n = mo, then f |fn —S ld < :: Since the indefinite integral of 


a non negative integrable function is a finite measure (23.1), it 
follows from 9.E that such an indefinite integral is continuous 
from above at 0. Consequently, if {Z,,} is a decreasing sequence 
of measurable sets with an empty intersection, then there exists 
a positive integer my such that, for m 2 mo, 


filfla <5 and Jolt —Sldu <5, Wie Deas 


Hence, if m = mo, then we have 


Jilfelde se — flew +f If ltu <e 
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for every positive integer m, and this is exactly the desired equi- 
continuity result. 

We turn to the proof of sufficiency. Since a countable union of 
measurable sets of o—finite measure is a measurable set of o—finite 
measure, it follows from 25.F that 


Ey = Unear {x fn(x) ¥ 0} 


is such a set. If {E,} is an increasing sequence of measurable 
sets of finite measure such that lim, En = Eo, and if Fy, = 
Ey — En, n = 1,2, +--+, then {F,} is a decreasing sequence and 
lim, F, = 0. The assumed equicontinuity implies that, for 
every positive number 4, there exists a positive integer & such that 


7) 
f | fn |\du < =, and consequently 
Fy 2 


fit — fn \du s [lf \du +f lfe \du <6 


If for any fixed e > 0 we write 
Gan aaa {x: | fm (x) — fn(x) | P= e}, 


then it follows that 


fife —Foldu sf lim —Salu tf lfm —Suldu 


S eu(E;) + f natin — fa Id 


By convergence in measure and uniform absolute continuity, the 
second term on the dominant side of this chain of inequalities 
may be made arbitrarily small by choosing m and 7 sufficiently 
large, so that 


lim SUPrn [mee —Sn |du Ss eu (Ex). 


Since ¢ is arbitrary, it follows that 


linn f | fa — faldu = 0. 
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Since 


itm =o bd = ffm — So Id = 


=f lfm — fa ld +f | fm — fu ii 


we have 


lim supa, f he = Fildes 
and therefore, since 6 is arbitrary, 
litinn ffm — fa itu = 0. 


We have proved, in other words, that the sequence { f,} is funda- 
mental in the mean; it follows from Theorem B that there exists 
an integrable function g such that {/,} mean converges to g. 
Since mean convergence implies convergence in measure, we must 
havef=gae. Jf 

The following result is known as Lebesgue’s bounded con- 
vergence theorem. 


Theorem D. If {f,} is a sequence of integrable functions 
which converges in measure to f [or else converges to f a.e.], and 
if g is an integrable function such that | fr(x) | S | g(x) | ae. 
n= 1,2, ---, then f is integrable and the sequence {fn} con- 
verges to f in the mean. 


Proof. In the case of convergence in measure, the theorem is 
an immediate corollary of Theorem C—the uniformities required 
in that theorem are all consequences of the inequality 


fi faldus fl glen 
E E 


The case of convergence a.e. may be reduced to that of conver- 
gence in measure (even though the integrals are not necessarily 
over a set of finite measure, cf. 22.4 and 22.5) by making use of 
the existence of g. If we assume, as we may without any loss 
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of generality, that | f.(x)| S| g(«)| and | f(*)| S| g(x)| for 
every x in X, then we have, for every fixed positive number e, 


Ex = Usea {x2 |Als) - 70) | 2 J & [et] 900) 12 ‘|, 


and therefore u(E,) < ©,” = 1,2, ---. Since the assumption of 
convergence a.e. implies that u((\%-1 Zn) = 0, it follows from 
9.E that 


lim sup, w({x: | f(x) — f(x) | = e}) S lim, w(Z,) = 
ulin ES = 0: 


In other words, convergence a.e., together with being bounded 
by an integrable function, implies convergence in measure, and 
the proof of the theorem is complete. J 


(1) Is the set of all integrable functions a Banach space with respect to the 


norm defined by || f || = fs lau? 

(2) If {fn} is a uniformly fundamental sequence of functions, integrable over 
a measurable set E of finite measure, then the function f, defined by f(x) = 
limn fa(x), is integrable over E and f | fn —f\du > Oasn — o, 

(3) If the measure space (X,S,u) is finite, then Theorem C remains true even 
if the equicontinuity condition is omitted. 

(4) Let (X,S,u) be the space of positive integers (cf. 22.4). 

(4a) Write 


| ae 
fa = fr Laka; 
0 if k>n. 


The sequence { fa} may be used to show that the equicontinuity condition may 
not, in general, be omitted from Theorem C. 

(4b) The sequence described in (4a) may be used to show also that if {fn} 
is a uniformly convergent sequence of integrable functions whose limit function 


J is also integrable, then we do not necessarily have lim, Stadt = {fdus (cf. (2) 


above). 
(4c) Write 


1. 
alk) = z if 1sSkSn, 
0 if k>n 


The sequence { f,} may be used to show that the limit of a uniformly convergent 
sequence of integrable functions need not be integrable. 
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(5) Let X be the closed unit interval with Lebesgue measure and let {£,} 
be a decreasing sequence of open intervals such that u(En) = Ben 1,2, -+:. 


The sequence {xz,} may be used to show that the boundedness condition cannot 
be omitted from Theorem D. 

(6) If {fn} is a sequence of integrable functions which converges in the mean 
to the integrable function f, and if g is an essentially bounded measurable func- 
tion, then { f,g} mean converges to /g. 

(7) If {fn} is a sequence of non negative integrable functions which converges 


a.e. to an integrable function f, and if [fad = fren, n= 1, 2, ---, then {fn} 


converges to f in the mean. (Hint: write gn = fn — f and observe that the 
trivial inequality |f, —/| S/n +/ implies that 0 S$ xn- Sf. The bounded 
convergence theorem may therefore be applied to the sequence {g,~}; the de- 


sired result follows from the fact that f gn*du - f &, au =0,n = 1,2, ---.) 


§27. PROPERTIES OF INTEGRALS 


Theorem A. Jf f is measurable, g is integrable, and|f\| S 
| g| a.e., then f is integrable. 


Proof. Consideration of the positive and negative parts of f 
shows that it is sufficient to prove the theorem for non negative 
functions f. If f is a simple function, the result is clear. In the 
general case there is an increasing sequence { f,} of non negative 
simple functions such that lim, fn(x) = f(x) for all x in_X. Since 
0</fn <|g|, each f, is integrable and the desired result follows 


from the bounded convergence theorem. Jf 


Theorem B. Jf {f,} is an increasing sequence of extended 
real valued non negative measurable functions and if lima fn(x) 


= f(x) a.e., then lim, Pfu = [fdv. 
Proof. If f is integrable, then the result follows from the 


bounded convergence theorem and Theorem A. The only novel 
feature of the present theorem is its application to the not neces- 


sarily integrable case; we have to prove that if f dp = ~, then 


lima f frdp = ©, or, in other words, that if lim, f Fndu <0, then 
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J is integrable. From the finiteness of the limit we may conclude 
that 


Hitnn | [fmdu — [fads | = 0. 


Since fm — fn is of constant sign for each fixed m and , we have 


| [Fndis ~ [fade | = fi fm — fu id 


so that the sequence {f,} is mean convergent and therefore 
(26.B) mean converges to an integrable function g. Since mean 
convergence implies convergence in measure, and therefore a.e. 
convergence for some subsequence, we have f = ga.e. fj 


Theorem C. 4 measurable function is integrable if and only 
if its absolute value is integrable. 


Proof. The new part of this theorem is the assertion that the 
integrability of | f| implies that of f, and this follows from Theorem 
A with |/| in place of zg. §j 


Theorem D. Iff is integrable and g is an essentially bounded 
measurable function, then fg is integrable. 


Proof. If |g|<c ae., then |fg| Sc|f| a.e. and therefore 
the result follows from Theorem C. Jf 


Theorem E. I[f/ is an essentially bounded measurable func- 
tion and E is a measurable set of finite measure, then f is 
integrable over E. 


Proof. Since the characteristic function of a measurable set 
of finite measure is an integrable function, the result follows 
from Theorem D with xg and f in place of fandg. Jj 

Our next and final result is known as Fatou’s lemma. 


Theorem F. If { f,} is a sequence of non negative integrable 
Junctions for which 


lim inf, f fadu <%, 
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then the function f, defined by 
J (x) = lim inf, fn(*), 
is integrable and 


f. fdu lim inf, f Sail 


Proof. If g,(x) = inf {f,(x~): n Si < o}, then g, Sf, and 
the sequence { g,} is increasing. Since f gndu S f Find, it follows 
that 

lima f id= lim inks f fda ee: 


Since lim, ga(x) = lim inf, fr(x) = /(*), it follows from Theorem 
B that / is integrable and 


f. eae f a= nats f Fille § 


(1) If f is a measurable function, g is an integrable function, and a and B 
are real numbers such that a S f(x) S B a.e., then there exists a real number 7, 


assy 8, such that {f| ¢|du = v fleldu. (Hint: 
a figid s ffl eli sB- fl elds) 


This result is known as the mean value theorem for integrals. 
(2) If {f,} is a sequence of integrable functions such that 


ened fifa |du < %, 


then the series )-n=1/n(x) converges a.e. to an integrable function f and 


Sfau = Xen Spada. 


(Hint: apply Theorem B to the sequence of partial sums of the series 
Yine1|fn(x) | and recall that absolute convergence implies convergence.) 

(3) If f and fy are integrable functions, m = 1, 2, ---, such that | f(x) | S 
| A(x) | a.e., then the functions f* and fy, defined by 


S*(«) = lim supn fa(x) and fe (x) = lim inf, f..(x), 


are integrable and 


f ‘f*du = lim sup, f “fads = Tim inf, f. “fad 2 f gdp. 
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(Hint: by considering separately the positive and negative parts, reduce the 
general case to the case of non negative fn, and then apply Fatou’s lemma to 
(f + fa} and {f — fat.) 

(4) A measurable function f is integrable over a measurable set E of finite 
measure if and only if the series 


Dinar ME A {xz [ f(x) | 2 2}) 


converges. (Hint: use Abel’s method of partial summation.) What can be 
said if u(E) = ©, or if the summation is extended from ” = 0? 

(5) Suppose that {Z,} is a sequence of measurable sets and m is any fixed 
positive integer, and let G be the set of all those points which belong to Ey 
for at least m values of m. Then G is measurable and 


my(G) Ss Desai w(En). 
(Hint: consider )>*_, 5 Xz, (*)du(x).) 


(6) Suppose that f is a finite valued, measurable function on a totally finite 
measure space (X,S,u), and write 


am Dita dalek <r cH), 212, 
Then 


fteu = lima Sa, 


in the sense that if f is integrable, then each series s, is absolutely convergent, 
the limit exists, and is equal to the integral, and, conversely, if any one of the 
series 5, converges absolutely, then all others do, the limit exists, f is integrable, 
and the equality holds. (Hint: it is sufficient to prove the result for non nega- 
tive functions. Write 


say ~ <I STA, = 012- 
0 if f(x) =0, 


and apply Theorem B. For the converse direction observe that 
S(x) S 2fnlx) + w(X), 


so that f is integrable and therefore the preceding reasoning applies.) 

(7) The following considerations are at the basis of an alternative popular 
approach to integration. Let be a non negative integrable function on a meas- 
ure space (X,S,u). For every measurable set E we write 


a(E) = inf { f(x): xe E}, 


and for every finite, disjoint class C = {F1, ---, En} of measurable sets we write 


s(C) = Dot.1 a(E,)u(Z,). 
We assert that the supremum of all numbers of the form s(C) is equal to f Sap. 
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If f is a simple function, the result is clear. If gis a non negative simple function 
such that g Sf, say g = )P-1 axe, we write C = {Ei ---, En}. Then 


Sedu = Cte ram(E) S Dts a(EQu(Ed) = s(C). 


It follows that if {g,} is an increasing sequence of non negative simple functions 
converging tof, then 


lima f gndu S sup s(C), 
and therefore ffau S sup s(C), On the other hand, for every C, s(C) S fan, 


since s(C) is, in fact, the integral of a function such as g. 

(7a) Does the result of the preceding paragraph extend to non integrable, 
non negative functions? 

(7b) If f is an integrable function on a totally finite measure space (X,S,u), 
and if its distribution function g is continuous, (cf. 18.11), then 


Seu = fade) 


(cf. 25.4). (Hint: assume f 2 0, and make use of (7) above by considering the 
“approximating sums” s(C) of both integrals.) 


Chapter VI 


GENERAL SET FUNCTIONS 


§ 28. SIGNED MEASURES 


In this chapter we shall discuss a not too difficult but rather 
useful generalization of the notion of measure; the principal dif- 
ference between measures and the set functions we now propose 
to treat is that the latter are not required to be non negative. 

Suppose that mw, and ye are two measures on a o-ring S of sub- 
sets of aset X. If we define, for every set Ein S, n(E) = w,(Z) + 
u2(E), then it is clear that u is a measure, and this result, on the 
possibility of adding two measures, extends immediately to any 
finite sum. Another way of manufacturing new measures is to 
multiply a given measure by an arbitrary non negative constant. 
Combining these two methods, we see that if {41, -++, wat is a 
finite set of measures and {a, ---, an} is a finite set of non nega- 
tive real numbers, then the set function yu, defined for every set 
E in S by 

u(E) = Dita. ami(Z), 
is a measure. 

The situation is different if we allow negative coefficients. If, 
for instance, », and pe. are two measures on §S, and if we define 
uw by w(E) = m(E) — ue(E), then we face two new possibilities. 
The first of these, namely that 4 may be negative on some sets, 
is not only not a serious objection but in fact an interesting 
phenomenon worth investigating. The second possibility presents, 


however, a difficulty that has to be overcome before the investiga- 
117 
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tion can begin. It can, namely, happen that 4,(Z) = ue(E) = ~; 
what sense, in this case, can we make of the expression for u(E)? 

To avoid the difficulty of indeterminate forms, we shall agree 
to subtract two measures only if at least one of them is finite. 
This convention is analogous to the one we adopted in presenting 


the most general definition of the symbol f fdu. (We recall that 


fdp is defined for a measurable function / if and only if at least 


one of the two functions f+ and f~ is integrable, i.e. if and only 
if at least one of the two set functions vt and »~, defined by 


y+(E) = Jf ftdy and »-(E) = I frau, 


is a finite measure.) The analogy can be carried further: if f is 


a measurable function such that f fd is defined, then the set func- 


tion », defined by »(E) = f Jdp, is the difference of two measures. 
E 


The definition that we want to make is sufficiently motivated 
by the preceding paragraphs. We define a signed measure as 
an extended real valued, countably additive set function » on the 
class of all measurable sets of a measurable space (X,S), such that 
#(0) = 0, and such that » assumes at most one of the values 
+o and —o, 

We observe that implicit in the requirement of countable addi- 
tivity is the requirement that if {EZ,} is a disjoint sequence of 
measurable sets, then the series )\n=1 u(En) is either convergent 
or definitely divergent (to ++ or —o)—in any case that the 
symbol S>7_, »(E,) makes sense. 

The words “{totally] finite” and “[totally] o-finite” will be 
used for signed measures just as for measures, except that u(Z) 
has to be replaced by | u(E) |, or, equivalently, u(E) < « has to 
be replaced by —© < u(E) < ~. For instance, a signed measure 
u is totally finite if X is measurable and | p(X) | < @. 

One of our objectives in the following study is to prove that 
every signed measure is the difference of two measures. If this 
result is granted, it follows that we could have defined the concept 
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of signed measure on a ring and then attempted to copy the ex- 
tension procedure for measures; and it follows equally that it 
would have been a waste of time to do so, since we may, instead, 
reduce the discussion of signed measures to that of measures. 

It follows from the definition of signed measures, just as for 
measures, that a signed measure is finitely additive and, there- 
fore, subtractive. 


Theorem A. If E and F are measurable sets and wis @ signed 
measure such that 


ECF and |ulF)| <~, 


then | h(E) | < &. 


Proof. We have u(F) = n(F — E)+ u(Z). If exactly one 
of the summands is infinite, then so is u(F); if they are both 
infinite, then (since 4 assumes at most one of the values ++ and 
—o) they are equal and again u(F) is infinite. Only one possi- 
bility remains, namely that both summands are finite, and this 
proves that every measurable subset of a set of finite signed 
measure has finite signed measure. J 


Theorem B. [Jf u is a signed measure and {E,} is a disjoint 
sequence of measurable sets such that | p(s: En) | < ~, then 
the series )nuy u(En) is absolutely convergent. 


Proof. Write 


E,* = a if u(E,) 2 0, 
O if pw(E,) <9, 

and 
ses ie if u(En) s 0, 
" “10 if w(E,) > 0. 

Then 

u(Ure1 E,.*) = pn w(E,,*) 

and 


a(Ure1 E,-) = pBaan w(E,~). 


Since the terms of both the last written series are of constant 
sign, and since » takes on at most one of the values +0 and —», 
it follows that at least one of these series is convergent. Since 
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the sum of the two series is the convergent series }ona; u(En), 
it follows that they both converge, and, since the convergence of 
the series of positive terms and the series of negative terms is 
equivalent to absolute convergence, the proof of the theorem is 
complete. Jf 


TheoremC. Jf wis a signed measure, if {E,} is a monotone 
sequence of measurable sets, and if, in case {E,} is a decreasing 
sequence, | u(En) | < © for at least one value of n, then 


u(lim, E,) = lim, p(Z,). 


Proof. The proof of the assertion concerning increasing se- 
quences is the same as for measures (replacing { E,,} by the disjoint 
sequence {E; — E;_,} of differences, cf. 9.D); the same is true 
for decreasing sequences (reduction to the preceding case by 
complementation, cf. 9.E), except that Theorem A has to be used 
to ensure the finiteness of the subtrahends that occur. Jf 


(1) The sum of two [totally] o—finite measures is a [totally] o—finite measure. 
Is this assertion valid for infinite sums? 

(2) A complex measure on the class of all measurable sets of a measurable 
space is a set function » such that, for every measurable set E, u(E) = wi(Z) + 
iuo(E), where i = VV —1, and where 1 and py are signed measures in the sense 
of this section. Are Theorems A, B, and C true for complex measures? 

(3) If a signed measure u is the difference of two measures in two ways, 
B= M1 — Me and wp = ¥; — vg, then is it true that wu: = ¥) and pe = v2? 

(4) The fact that a signed measure assumes at most one of the values +o 
and —co follows from the requirement of additivity. (Hint: if w(Z) = +o and 
u(F) = —o, then the right side of at least one of the relations 


w(E) = w(E — F)+H(EN F), 


F u(F) = w(F — E) +u(E Nf F), 
an 
B(E AF) = w(E — F) + pF — £) 


és indeterminate.) 


§ 29. HAHN AND JORDAN DECOMPOSITIONS 


If » is a signed measure on the class of all measurable sets of a 
measurable space (X,S), we shall call a set E positive (with respect 
to w) if, for every measurable set F, E MN F is measurable and 
u(E N F) 2 0; similarly we shall call Z negative if, for every 
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measurable set F, EMF is measurable and p(EN F) < 0. 
The empty set is both positive and negative in this sense; we do 
not assert that any other, non trivial, positive sets or negative 
sets necessarily exist. 


Theorem A. Jf u is a signed measure, then there exist two 
disjoint sets A and B whose union is X, such that A ts positive 
and B is negative with respect to p. 


The sets 4 and B are said to form a Hahn decomposition of X 
with respect to pu. 

Proof. Since » assumes at most one of the values + and 
—oo, we may assume that, say 


—o <p(E) s ~ 


for every measurable set E. Since the difference of two negative 
sets, and a disjoint, countable union of negative sets are obviously 
negative, it follows that every countable union of negative sets 
is negative. We write 6 = inf u(B) for all measurable negative 
sets B. Let {B,} be a sequence of measurable negative sets such 
that lim; u(B;) = 6; if B = U7, B,, then B is a measurable 
negative set for which yu(B) is minimal. 

We shall prove that the set 4 = X — B is a positive set. 
Suppose that, on the contrary, Ey is a measurable subset of 4 
for which p(Eo) <0. The set Ey cannot be a negative set, for 
then B U Ey would be a negative set with a smaller value of u 
than u(B), which is impossible. Let k, be the smallest positive 
integer with the property that Ey contains a measurable set F, 


1 
for which y(£;) 25 —. (Observe that, since u(Eo) <0, u(Ep) 
and u(£,) are both aad ) Since 
| 
u(Eo — Ex) = w(Eo) — w(F1) S w(Zo) — a <0, 
1 
the argument just applied to Eo is applicable to Ey — £;, also. 
Let kp be the smallest positive integer with the property that 


Eo — E, contains a measurable subset Eg with n(Z2) = —, and 


ke 
proceed so on ad infinitum. Since wis finite valued for measurable 
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1 
subsets of Ey (28.A), we must have lim, i 0. It follows that, 
for every measurable subset F of ° 


Fy = Ey — Uj-1 E;, 


we have u(F) <0, ie. that Fo is a measurable negative set. 
Since F is disjoint from B, and since 


u(Fo) = u(Eo) — Soj-1 u(E;) S u(Eo) < 9, 


this contradicts the minimality of B, and we conclude that the 
hypothesis u(Zo) < Ois untenable. J 

It is not difficult to construct examples to show that a Hahn 
decomposition is zot unique. If, however, 


X=4,UB, and X= /A4,U Be 


are two Hahn decompositions of X, then we can prove that, for 
every measurable set £, 


w(EN A;) = (EN A.) and w(EN B,) = w(EN Ba). 
To see this, we observe that 
EN(4,-4) CENA, 
so that u(E N (4; — Ae)) = O, and 
EN (4, — 42) CEN B, 


so that u(EN (A, — 42)) $0. Hence n(EN (4, — A2)) = 0 
and, by symmetry, u(E N (dz — 4,)) = 0; it follows that 


MEN A) = wW(EN (A U A2)) = BE n Ap). 
It follows from this result that the equations 
ut(Z)=u(EN A) and w(E) = —n(EN B) 


unambiguously define two set functions 4+ and uw on the class 
of all measurable sets, called, respectively, the upper variation 
and the lower variation of u. The set function | u|, defined 
for every measurable set E by | u|(E) = »t(E) + u7(B), is the 
total variation of ». (Observe the important notational distinc- 
tion between | » |(Z) and | u(£) |.) 


[Suc. 29] GENERAL SET FUNCTIONS 123 


Theorem B. The upper, lower, and total variations of a 
signed measure w are measures and p(E) = wt(E) — p(B) 
for every measurable set E. If w is [totally] finite or o-finite, 
then so also are n* and uw; at least one of the measures w+ and 
a is always finite. 


Proof. The variations of u are clearly non negative; if every 
measurable set is a countable union of measurable sets for which u 
is finite, it follows from 28.A that the same is true for u* and yuo. 
The equation » = w+ — uw follows from the definitions of wt 
and y~; the fact that u takes on at most one of the values ++ 
and —o implies that at least one of the set functions w* and p~ 
is always finite. Since the countable additivity of w+ and p- 
is evident, the proof is complete. Jf 

It follows from Theorem B that every signed measure is the 
difference of two measures (of which at least one is finite); the 
representation of u as the difference of its upper and lower varia- 
tions is called the Jordan decomposition of u. 


(1) If « is a finite signed measure and if {En} is a sequence of measurable 
sets such that lim, £, exists, (i.e. such that lim sup, Zn = lim inf, E£,), then 


w(limn En) = lima w(E,). 


(2) A finite signed measure, together with its variations, is bounded. For 
this reason finite signed measures are often said to be of bounded variation. 
(3) If u is a signed measure and if E is a measurable set, then 


wt(E) = sup {u(F): ED FeS} and mw-(E) = — inf {u(F): ED FeS}. 


An alternative and frequently used proof of the validity of the Jordan decomposi- 
tion may be given by treating these equations as the definitions of w+ and w-. 
(4) Does the set of all totally finite signed measures on a o-algebra form a 
Banach space with respect to the norm defined by || # || = | u |(X)? 
(5) If (X,S,u) is a measure space and f is an integrable function on X, then 


the set function », defined by »(E) =f Sf (x)du(x), is a finite signed measure, and 
E 


vt(E) =f frau, (BE) =f fod. 


What is | v |(E) in terms of f? 

(6) If wand v are totally finite measures on a o—algebra S and if E is a set in 
S, then, corresponding to every real number ¢, there exists a set 4, in S such that 
4, C E and such that, for every set F in S for which F C 4; [or for which 
FC E — Aj we have o(F) S tu(F), [or (F) 2 tu(F)). 
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(7) If « is a signed measure and f is a measurable function such that / is 
integrable with respect to |u|, then we may write, by definition, 


fra = ffau* — ffdu-. 


This integral has many of the essential properties of the “positive” integrals 
discussed in Chapter V. If p is a finite signed measure, then, for every measur- 
able set E, 


|u|(Z) = sup |f fai |, 


where the supremum is extended over all measurable functions f such that 


lf| 1. 


(8) By the separate consideration of real and imaginary parts, integrals such 
as f fdu may be defined for complex valued functions f and complex measures p; 
(cf. 28.2). Motivated by (7) above, we define the total variation of a finite 
complex measure yu by | w|(Z) = sup If fau |, where the supremum 1s extended 


over all (possibly complex valued) measurable functions f such that |/| S 1. 
What is the relation between | «| and the total variations of the real and imagi- 
nary parts of yu? 


§30. ABSOLUTE CONTINUITY 


Motivated by the properties of indefinite integrals, we intro- 
duced the abstract concept of signed measure, and we showed that 
the abstraction had several of the important properties of the 
concrete concept which it generalized. Indefinite integrals have, 
however, certain additional properties (or, rather, certain rela- 
tions to the measures in terms of which they are defined) that are 
not shared by general signed measures. In a special case we have 
already discussed one such property of very great significance 
(absolute continuity, § 23); we propose now to examine a more 
general framework in which the discussion of absolute continuity 
still makes sense. 

If (X,S) is a measurable space and y and » are signed measures 
on S, we say that » is absolutely continuous with respect to u, 
in symbols » «uy, if v(Z) = 0 for every measurable set E for 
which | »|(E) = 0. In a suggestively imprecise phrase, » Xu 
means that » is small whenever yu is small. We call attention, 
however, to the lack of symmetry in the precise form of the defini- 
tion; the smallness of » is expressed by a condition on its total 
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variation. Our first result concerning absolute continuity asserts 
that this asymmetry is only apparent. 


Theorem A. If y and v are signed measures, then the condt- 
tions 


(a) vp KH, 
(b) vt Kp and v- Kp, 
(c) | v | <«K | L l, 


are mutually equivalent. 


Proof. If (a) is valid, then »(E) = 0 whenever | »|(E) = 0. 
If X = 4 U B is a Hahn decomposition with respect to v, then 
we have, whenever | u|(E) = 0, 


0S|4\(EN 4) s|z|(E) =0 
and 


0S|ul(EN B) <|p|(Z) =0, 
and therefore 
vt(E) = o(EN A) =0 and v (EF) = oA(EN B) = 0; 


this proves the validity of (b). 
The facts that (b) implies (c) and (c) implies (a) follow from 
the relations 


| » |(E) =vt(E)+v-(E) and 08 | r(E)| s | v |(E) 


respectively. §f 

The following theorem establishes the relation between our 
present form of the definition of absolute continuity and the one 
we used (for finite valued set functions) in §23. The theorem 
asserts essentially that another precise interpretation of “y is 
small whenever y is small,” which is apparently quite different 
from the definition of absolute continuity, is in the presence of a 
finiteness condition equivalent to it. 


Theorem B. If v is a finite signed measure and if wis a 
Signed measure such that v &K p, then, corresponding to every 
positive number e, there is a positive number & such that 
ly l(E) < efor every measurable set E for which | »|(E) < 6. 
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Proof. Suppose that it is possible, for some e > 0, to find a 
sequence {E,} of measurable sets such that | u |(E,) <x and 
|v |(E,) 26,2 =1,2,---. If E = lim sup, En, then 


1 
[ul(A) Ss mal wl) <s5> m=1,2, ++, 


and therefore | 1 |(Z) = 0. On the other hand (since » is finite) 
|v |(E) = lim, |» |(En U Eng, U---) 2 lim sup, |v |(Z,) 2 «. 


Since this contradicts the relation » « p, the proof of the theorem 
is complete. 

It is easy to verify that the relation “<” is reflexive (i.e. 
uw Kz) and transitive (i.e. wy Km, and pe Kyuz imply that 
Hi Kg). Two signed measures uw and » for which both » <u 
and uw <» are called equivalent, in symbols p = ». 

The antithesis of the relation of absolute continuity is the rela- 
tion of singularity. If (X,S) is a measurable space and uw and » 
are signed measures on §S, we say that yw and » are mutually singu- 
lar, or more simply that » and » are singular, in symbols p 1 », 
if there exist two disjoint sets 4 and B whose union is X such 
that, for every measurable set E, 4 N E and B Nf E are measur- 
able and |u|(4N E) =|v»|(BN E) =0. Despite the sym- 
metry of the relation, it is occasionally more natural to use an 
unsymmetric expression such as “‘v is singular with respect to p” 
instead of “‘y and » are singular.” 

It is clear that singularity is indeed an extreme form of non 
absolute continuity. If » is singular with respect to uw, then not 
only is it false that the vanishing of |u| implies that of | »|, 
but in fact essentially the only sets for which | v | does not neces- 
sarily vanish are the ones for which | u| does. 

We conclude this section with the introduction of a new nota- 
tion. We have already used the traditional and suggestive “al- 
most everywhere” terminology on measure spaces; this is perfectly 
satisfactory as long as we restrict our attention to one measure 
at a time. Since, however, in the discussion of absolute con- 
tinuity and singularity we have necessarily to deal with severai 
measures simultaneously, and since it is clumsy to say ‘“‘almost 
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everywhere with respect to »” very often, we shall adopt the fol- 
lowing convention. If, for each point x of a measurable space 
(X,S), x(x) is a proposition concerning x, and if yu is a signed 
measure on §, then the symbol 


w(x) [u] or om [u] 


shall mean that a(x) is true for almost every x with respect to 
the measure | »|. Thus, for instance, if f and g are two functions 
on_X, we shall write f = g [u] for the statement that {x: f(x) # 
g(x)} is a measurable set of measure zero with respect to | »|. 
The symbol [u] may be read as “modulo p.” 


(1) If « is a signed measure and f is a function integrable with respect to 
|u|, and if v is defined for every measurable set E by »(E) =f fd (cf. 29.7), 


then v<u. 
(2) Let the measure space (X,S,u) be the unit interval with Lebesgue measure. 
Write F = {x:0 S x S 3}, and let f; and fy be the functions defined by fi(x) = 


2xr(x) — 1 and fo(x) = x. If the set functions y; are defined by n,(Z) = f Sid, 
E 

f= 1, 2, then pea<py. It is not, however, true that w2(E) = 0 whenever 

u,(E) = 0. If 42 were defined by u2(E) =f. (fe — 4)du, then even this stronger 


condition would be satisfied. 

(3) For every signed measure p, the variations w+ and w~ are mutually 
singular, and they are each absolutely continuous with respect to p. 

(4) For every signed measure p, u = | » |. 

(5) If « is a signed measure and E is a measurable set, then |» |(Z) = 0 
if and only if n(F) = 0 for every measurable subset F of E. 

(6) If u and » are any two measures on a o-ring S, then y< yp + ». 

(7) Let fy and fo be integrable functions on a totally finite measure space 
(X,S,u) and let yu; be the indefinite integral of f;,, # = 1,2. If u({x:fi(x) = 0} A 
{x:fo(x) = O}) = 0, then yr = Me. 

(8) Let » be the Cantor function (cf. 19.3), and let wo be the Lebesgue— 
Stieltjes measure, on the Borel subsets of the unit interval, induced by y; 
(cf. 15.9). If is Lebesgue measure, then po and p are mutually singular. 

(9) If 4 and » are signed measures such that » is both absolutely continuous 
and singular with respect to p, then »p = 0. 

(10) If 1, v2, and yw are finite signed measures such that both » and vg are 
singular with respect to pw, then vy = »; + 2 is also singular with respect to pu. 
(Hint: if X = 4, U By and X = 4, U Be are decompositions such that | u | 
is identically zero for measurable subsets of 4; and |»; | is identically zero for 
measurable subsets of By, i = 1, 2, then 


X =[(4,N &) U (4, N By) U (42 N Bi] U (B, N Ba) 


is such a decomposition for yz and ».) 
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(11) If and v are measures on a o-algebra S such that p is finite and v< p, 
then there exists a measurable set E such that X — E is of o—finite measure with 
respect to v, and such that, for every measurable subset F of E, »(F) is either 
0 or ©. (Hint: use the method of exhaustion (cf. 17.3), to find a measurable set 
E with the property that, for every measurable subset F of E, v(F) is either 0 
or %, and such that y(Z) is maximal; another application of the method of ex- 
haustion shows that X — E is of o—finite measure with respect to v.) 

(12) Theorem B is not necessarily true if » is not finite. (Hint: let X be the 
set of all positive integers, and, for every subset E of X, write 


u(Z) = pan ez2-"%, v(E) = oa ex 2") 


§ 31. THE RADON-NIKODYM THEOREM 


Theorem A. Jf u and v are totally finite measures such 
that v K pw and v is not identically zero, then there exists a post- 
tive number ¢ and a measurable set A such that w(A) > 0 and 
such that A is a positive set for the signed measure v — eu. 


Proof. Let X = 4, U B, be a Hahn decomposition with 


1 : 
respect to the signed measure » — — yw, » = 1, 2, ---, and write 
n 


Ao ae, iia Ans Bo iia gan Ba. 


Since By C Bn, we have 
1 
Os »(Bo) = -, H(Bo)» a= 1, 2; mt ty 


and consequently »(By) = 0. It follows that v(4o) > 0 and 
therefore, by absolute continuity, that u(4) > 0. Hence we 
must have u(4,) > 0 for at least one value of 7; if, for such a 


, the requirements of the 


2 le 


value of 7, we write 4 = 4, ande = 


theorem are all satisfied. J 
We proceed now to establish the fundamental result (known 
as the Radon-Nikodym theorem) concerning absolute continuity. 


Theorem B. Jf (X,S,u) is a totally o-finite measure space 
and if a o—finite signed measure v on S is absolutely continuous 
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with respect to p, then there exists a finite valued measurable 
function f on X such that 


»(E) = f fay 


for every measurable set E. The function f is unique in the sense 
that if also v(E) = f gdu, E eS, then f = ¢ (ul. 
E 


We emphasize the fact that / is not asserted to be integrable; 
it is, in fact, clear that a necessary and sufficient condition that 


f be integrable is that » be finite. The use of the symbol fi Sdu 


implicitly asserts, however (cf. § 25), that either the positive or 
the negative part of f is integrable, corresponding to the fact that 
either the upper or the lower variation of » is finite. 

Proof. Since X is a countable, disjoint union of measurable 
sets on which both u and » are finite, there is no loss of generality 
(for both the existence and the uniqueness proofs) in assuming 
finiteness in the first place. Since if » is finite, f is integrable, 
uniqueness follows from 25.E. Since, finally, the assumption 
v <p is equivalent to the simultaneous validity of the conditions 


vt <p and v~ <p, 


it remains only to prove the existence of f in the case in which 
both yp and » are finite measures. 
Let x be the class of all non negative functions /, integrable 


with respect to w, such that f fdu & v(E) for every measurable 
E 


set E, and write 
a = sup | [rawr e x} 
Let {f,} be a sequence of functions in x such that 


lima Sifu =a, 


If E is any fixed measurable set, 7 is any fixed positive integer, 
and gx =f; U---U f;, then E may be written as a finite, disjoint 
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union of measurable sets, E = E, U---U E,, so that ga(x) = 
Fi(*) for x in Ej, 7 = 1, ---, . Consequently we have 


Sandu = Chen [fide s Dh) = 0. 


If we write fo(x) = sup {fa(x): 2 =1, 2, -+-}, then fo(x) = 
lim, gn(x) and it follows from 27.B that foe K and f Sodu =a. 
Since fo is integrable, there exists a finite valued function f such 
that fo =f [u]; we shall prove that if »(E) = »(E) ~ f fa, 


then the measure 79 is identically zero. 

If v9 is not identically zero, then, by Theorem A, there exists a 
positive number e and a measurable set 4 such that n(4) > 0 
and such that 


ew(EN A) SH(EN A) = (EN 4) -f Fi 
EnA 
for every measurable set FE. If g = f+ exa, then 
fisdu=ffdu+ EN A Sf fd +E A) SE) 
E E E-A 


for every measurable set E, so that ge KX. Since, however, 
fedu = [fan + A) > 0, 


this contradicts the maximality of f fdu, and the proof of the 


theorem is complete. J 


(1) If (X,S,u) is a measure space and if »(Z) =f fd for every measurapie 

set E, then 
X = {x: f(x) > 0} U {x: f(x) S 0} 
is a Hahn decomposition with respect to ». 

(2a) Suppose that (X,S) is a measurable space and yp and » are totally finite 
measures on S such thaty<y. If @ = + v and if v(E) =f Jfdi for every 
measurable set E, then 0 S f(x) < 1 [uJ]. 

(2b) If f gdv = {eda for every non negative measurable function g, then 
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v(E) =f 5# for every measurable set E. (Hint: rewrite the hypothesis 
Pe 

- _ - : — XE. 

in the form fg(1 —f)dv = ffedu and, given E, write g if ) 


(3) Let (X,S,u) be the unit interval with Lebesgue measure and let M be a 
non measurable set. Let (1,81) and (a2,82) be two pairs of positive real numbers 
such that a; + 61 = a, + Be: = 1, and let fi; be the extension of u, determined 
by (a;,8;), to the o-ring § generated by S and M, i = 1, 2 (cf. 16.2). There 
exist measurable functions /; and fp such that 


Bi(E) = S Sidjlg and jix(E) = S Sodiis 


for every measurable set E. What are the functions fA; and fo? 

(4) The Radon-Nikodym theorem remains true even if » is only a signed 
measure. (Hint: let X = 4 U B be a Hahn decomposition with respect to 
u and apply the Radon—-Nikodym theorem separately to »y and w+ in 4 and tov 
and wo in B.) 

(5) Let u be a totally o-finite signed measure. Since both wt and wo are 
absolutely continuous with respect to both u and | » |, we have 


u+(E) =f fd =f evan | and p—(E) =f fa =f edu |. 


The functions f+, g+, f-, and g— satisfy the relations f; = g4 [uy] and f. = 
—g-_[u]. What are these functions? 

(6) If w is a signed measure and if »(E) =f feu and | » |(Z) =f gal w| for 
every measurable set EZ, then g = [/| [yl]. : 7 

(7) The Radon-Nikodym theorem remains true even if y is not o-finite, but, 
in this case, the integrand f is not necessarily finite valued. (Hint: it is sufficient 
to consider the case in which v is a measure and y is finite; in this case apply 
30.11.) 

(8) The Radon-Nikodym theorem is not necessarily true if » is not totally 
o-finite, even if » remains finite. (Hint: let X be an uncountable set and let 
S be the class of all those sets which are either countable or have countable 
complements. For every E in S, let 4(Z) be the number of points in E and let 
v(E) be 0 or 1 according as E is countable or not.) 

(9) If (X,S) is a measurable space and y and v are o-finite measures on S 
such that v<zy, then the Radon-Nikodym theorem may be applied to each 
measurable set separately. The question might be raised whether or not a 
function f may be defined once for all on the whole space X so as to serve as a 
suitable integrand simultaneously for every measurable set. The answer is no, 
as the following pathological example shows. 

Let 4 be any uncountable set (with, say, cardinal number a), and let B be a 
set of cardinal number 8 > a. Let X be the set of all ordered pairs (4,4) with 
aeA and be B. It is convenient to call a set of the form {(a,49): ae 4} a 
horizontal line, and a set of the form {(ao,4): 4 B} a vertical line. We shall 
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call a set E full on a horizontal or vertical line Z if L — Eis countable; (cf. 12.1). 
Let S be the class of all sets which may be covered by countably many hori- 
zontal and vertical lines, and which are such that on every horizontal and vertical 
line they are either countable or full. For every E in S Jet u(£) be the number of 
horizontal and vertical lines on which E is full, and let v(Z) be the number of 
vertical lines on which E is full. Clearly » and y are o-finite measures and 


v<<. Suppose now that there exists a function f on X such that (EZ) = f Side 


for every Ein S. It is easy to see that the set {x: f(x) = 0} has to be count 
able on every vertical line and full on every horizontal line. The first require- 
ment implies that the cardinal number of this set is at most @No = a, and the 
second requirement implies that the cardinal number of this set is at least 
B(@ — &) 2B. 

(10) There is a condition on measure spaces, which is more general than 
total o-finiteness and more restrictive than o—finiteness, in the presence of which 
the Radon-Nikodym theorem is still true. The condition is that the space be 
the union of a disjoint class D of measurable sets of finite measure with the 
property that every measurable set may be covered by countably many sets of 
D and a set of measure zero. The following is an example of a non totally 
o-finite measure space satisfying this condition. 

Let X be the Euclidean plane, and let S be the class of all those sets which 
may be covered by countably many horizontal lines and which are Lebesgue 
measurable on each such line. If E is a Lebesgue measurable subset of a hori- 
zontal line, define (EZ) to be the Lebesgue measure of E; for the general E in 
S, » is thereby uniquely determined by the requirement of countable additivity. 

(11) If, in (9) above, B = 4 and the cardinal number of this set is &, (= the 
smallest uncountable cardinal), then the proof breaks down, i.e. there exists in 
that case a subset E of X which is countable on every vertical and full on every 
horizontal line. (Hint: well order 4, i.e. assign to every @ in 4 an ordinal 
number £(a) < Q(= the smallest uncountable ordinal) so that the correspondence 
is one to one between all points of 4 and all ordinals less than Q, and write 
E = {(a,d): &(2) > &()}. 

(12) If w is a totally finite measure and »(Z) ={ Jfdp for every measurable 
set E, then the set e 


BO) = {x fe) S48 


is a negative set for the signed measure »v — ¢u; (cf. (1) above). A proof of the 
Radon-Nikodym theorem may be based on an attempt to reconstruct f from 
the sets Bit); (cf. 18.10). The main complication of this approach is the non 
uniqueness of negative sets. A tool for partially dealing with this complication 
is to select B(¢t), for each ¢, so as to maximize the value of u(B(s)). 


§32. DERIVATIVES OF SIGNED MEASURES 


There is a special notation for the functions which occur as 
integrands in the Radon—Nikodym theorem, which is frequently 
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very suggestive. If u is a totally o—finite measure and if »(Z) = 


f Jdu for every measurable set E, we shall write 
E 


dv 


f=— or dv=fadp. 


All the properties of Radon—Nikodym integrands (which we may 
also call Radon-Nikodym derivatives), which are suggested by 
the well known differential formalism, correspond to true theorems. 
dv, + v2) dy, dv, F 

as = Ta + =) » while 
others are more or less deep properties of integration. Examples 
of the latter kind of result are the chain rule for differentiation 
and, as an easy corollary, the substitution rule for the differentials 
occurring under an integral sign; both these results are precisely 
stated and proved below. It is, of course, important to remember 


Some of these are trivial (ex. 


: ee ee : 
that a Radon—Nikodym derivative . is unique only a.e. with 
T) 


respect to w, and that, therefore, in the detailed verbal interpreta- 
tion of a differential formula, frequent use has to be made of the 
qualifying ‘“‘almost everywhere.” 


Theorem A. Jf d and u are totally o-finite measures such 
that up Kd and if v is a@ totally o—finite signed measure such that 
v <p, then 


Proof. Since the validity of the desired equation for the upper 
and lower variations of » implies its validity for » itself, we may 
and do assume that » is a measure; for simplicity of notation we 

. ad dp 
write hz f and 7 
25.D that f = 0 [u] and therefore that there is no loss of generality 
in assuming that f is everywhere non negative. 

Let {fn} be an increasing sequence of non negative simple 


= g. Since v is non negative, it follows from 
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functions converging at every point to f, (20.B); then, by 27.B, 
we have 


lim, J ae f fa» and lim, if fatdd = I Sadr 


for every measurable set EF. Since, for every measurable set F, 
[xedu= ENF) =f gan =f xegar, 
E EnF E 


1t follows that f fads = [ fosdr, n = 1,2,---, and therefore that 
E E 
E) =| fdp = dy. 
“(E) =f fdu = f fear. A 


Theorem B. Jf d and u are totally o-finite measures such 
that up «KX, and if f is a finite valued measurable function for 


d 
which f ‘fdu is defined, then f js f f - dn. 


Proof. We write v(Z) = f fdu for every measurable set E, 
E 


d 
and apply Theorem A. It follows that »(Z) = f f no for every 
E 


measurable set E; the desired result follows by putting E = X. J 

Our next and final result concerning the relations among signed 
measures treats the Lebesgue decomposition of a totally o-finite 
signed measure into an absolutely continuous part and a singular 
part with respect to another totally o-finite signed measure. 


Theorem C. Jf (X,S) is a measurable space and y and v 
are totally o—finite signed measures on S, then there exist two 
uniquely determined totally o—finite signed measures vo and y 
whose sum is v, such thatvo 1 wandy Kp. 


Proof. As usual we may assume that u and v are finite. Since 
v;(i = 0, 1) will be absolutely continuous or singular with respect 
to » according as it is absolutely continuous or singular with 
respect to| u|, we may assume that wis a measure. Since, finally, 
we may treat vt and »~ separately, we may also assume that v 
is a measure. 
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The proof of the theorem for totally finite measures is a useful 
trick, based on the elementary observation that » is absolutely 
continuous with respect to » +». There exists, accordingly, a 
measurable function f such that 


»(E) = [faut [fw 


for every measurable set E. Since 0 S »(E) S w(E) + v(B), 
we have OS f S1[u+ 7] and therefore OS fs 1[»]. If we 
write 4 = {x: f(x) = 1} and B = {x:0 S f(x) < 1}, then 


(A) = f du + fd = wld) + od) 


and therefore (since » is finite) u(A) =0. If 
w(E) = (EN A) and » (EF) = (EN B) 


for every measurable set E, then it is clear that » 1 mu; it remains 


to prove that », Kp. 
If u(E) = 0, then 


fe =F (1B) =f fe 


and _ therefore f aC —f)dv=0. Since 1—f 20, it fol. 


lows that (EZ) = »(E N B) = 0; this completes the proof of the 
existence of v9 and 7. 

Ify = vy + » andy = x + 7, are two Lebesgue decompositions 
of v, then v9 — % = i — ». Since vp — dp is singular (cf. 30.10) 
and 7, — » is absolutely continuous with respect to y, it follows 
that Yo = Vo and y= V5 (cf. 30.9). i 


(1) Using the concept of integration with respect to a signed measure, the 
definition of Radon—Nikodym derivatives may be extended to the case in which 
wis a signed measure, and Theorem A remains true if X and yu are signed measures. 
(Hint: consider a Hahn decomposition with respect to each of the three signed 
measures J, #, and », and construct the decomposition of X into the eight sets ob- 
tained by taking one set from each decomposition and forming the intersection 
of these three sets. On measurable subsets of each of the eight sets each of the 
functions, A, 4, and », is of constant sign and therefore, after an obvious, slight 
modification, Theorem A applies.) 
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(2) If » and p are totally o-finite signed measures such that uw = py, then 


(3) If « and » are totally o-finite signed measures such that »<uy, then 


dy 
v (jez (x) = o}) = 0. 
(4) If uo, wi, and ue are totally finite measures, and if dup = fid(uo + w1) = 


Sod(uo + we) = fd(uo + H1 + me), then we have, almost everywhere with respect 
to wo + #1 + Be, 


Silx)folx) 


S(*) = Lar 0) = AE if filx)fo(x) 0, 
0 


if fi(x) = fo(x) = 0. 
(5) Given two sequences {u,! and {v_} of totally finite measures, write 


Bn = Dita Mis Pn = Doha, B= Shims, v= Srv, 


and assume that yw and »p are finite measures. If ¥n<fn, 1 = 1, 2, ---, then 
v<p and 
lim 22 = © fu] 
"din duo” 


The proof of this assertion may be based on the following lemmas. 

(5a) If {E£,} is a sequence of measurable sets such that #.(E,) = 0,2 = 1,2, 
ae then p(lim suPn E,) = 0. (Hint: BAUr=n Ex) Ss Den i: (Ex).) 

(5b) If {ga} and {yn} are sequences of functions such that on = Wn [Hnl, 


n= 1,2, ---, then, for a.e. x [ul], 
lim supa $.(x) = lim supp Wa(x) and lim inf, oa(x) = lim inf, p,(x). 


(Hint: write En = {x: ba(x) % Wa(x)} and apply (Sa).) 
In view of the result (5b) it is sufficient to prove (5) for any fixed determina- 


tion of the derivatives as If 
dfin 


an din = 
du =fn and aut n= 1, 2, ’ 


then it follows from Theorem A that one such determination is 


don frticthn 
din gi t---+ En 


(5c) eaifn -¢ and >(2.1gn = 1[uJ. (Hint: since 


Di ui(E) = f (gi t+*+ gn)du 


[ua], 2 = 1,2, +++. 


and 


Dt. 1 ¥i(E) =f tet faldu, n=1,2,--°, 


the desired result follows from 27.B and 25.E.) 


Chapter VII 


PRODUCT SPACES 


§ 33. CARTESIAN PRODUCTS 


If X and Y are any two sets (not necessarily subsets of the 
same space), the Cartesian product X x Y is the set of all ordered 
pairs (x,y), where x e X andy e Y. The best known example of a 
Cartesian product is the Euclidean plane, which is most often 
viewed as the product of two coordinate axes. Most of the 
development in the sequel uses the words and concepts suggested 
by this example. Thus, for instance, if 4c X and B CY, we 
shall call the set E = 4 X B (a subset of X K Y) a rectangle 
and we shall refer to the component sets 4 and B as its sides. 
(Observe that our usage here differs from the classical terminology 
which speaks of rectangles only if the sides are intervals.) 


Theorem A. 4 rectangle is empty if and only if one of its 
Sides is empty. 
Proof. If 4x Bx0, say (x,v)e4 xX B, then xe 4 and 
y © B,so that 4 #0 and B x0. If, on the other hand, neither 
A nor B is empty, then there is a point (x,y) such that (x,y) © 
AXB,sothatsxX BHO. § 


Theorem B. Jf Fi, = 4; X B, and E, = Az X Bz are 
non empty rectangles, then Ey C Ex if and only if 


Ay Cc Ae and B, Cc Bz. 
Proof. The “if” is obvious. To prove the converse, let (x,y) 


be a point in 4, X B, and suppose that there exists a point *: 
137 
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in A, such that x, e’ 42. Then 
(1,7) eA, X B, and (x159) e’ A,X Bo; 


it follows that no such point x, can exist and therefore 4, C 42. 
The same proof with only notational changes shows that 
B, (ca Bo. | 


Theorem C. Jf 4; X Bi = Ae X Be is a non empty rec- 
tangle, then Ay = Az and B, = Bo. 


Proof. It follows from Theorem B that 
4A, CA4,cC MH and B, c B.c B,. | 


Theorem D. Jf E=AXB, A, = 4, X By, and En = 
42 X Bz are non empty rectangles, then a necessary and suffi- 
cient condition that E be the disjoint union of Ey and Ep, is 
that either A is the disjoint union of A, and Ao, and B = By = 
Bo, or else B is the disjoint union of By and B., and A = A, = 
42. 


Proof. We prove first that the condition is necessary. Since 
E, cE and E, c E£, it follows from Theorem B that 4, Cc 4 
and 4,C A, and therefore that 4, U 4, Cc 4; similarly 
By U Bo cB. Since 


E, U E, c (4, U 42) X (B, U Be), 


it follows that 7c 4, U 4, and Bc B, U By, and therefore 
A= A4,U A, and B = B, U Bo. Since, finally, a similar argu- 
ment shows that 


0=£,N FE. > (4, N Az) X (B, N Be), 


it follows from Theorem A that at least one of the two sets 
4, 1 A, and B, N Bz is empty. 

Suppose, for instance, that 4, M 42 = 0; we are to show that 
in this case B = By = B,. (The case B, N By = O is treated 
similarly.) Suppose on the contrary that there exists a point 
y in B— B,. Then, if x is any point in 4, we have (x,y) eZ, 
but (since y e’ B,), (x,y) e’ Ey, and (since x e’ Ae), (x,y) © Es. 
Since this contradicts the assumption E = E, U E), it follows 
that B — B, = 0 and also, by a similar argument, B — Bz = 0. 
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The sufficiency of the condition is easier. If, for instance, 4 
is the disjoint union of 4; and 4, and B = B, = B,, then 4d D Ay, 
AD 42,B> B,,B > B2,sothat ED E, U Ey. Also if (*,y) © E, 
then 

(x,y) eZ, or (wy) e & 


according as x © 4; or « © 42, so that E is indeed the disjoint 
union of EZ, and Ey. §j 


Theorem E. Jf S and T are rings of subsets of X and Y 
respectively, then the class R of all finite, disjoint unions of 
rectangles of the form A X B, where A eS and B eT, is a ring. 


Proof. We observe first that the intersection of two sets of 
the form 4 X B is another set of that form. If either of the two 
given sets, or their intersection, is empty, this result is trivial. If 


E, = 4.X Bi, E, = 42X B, and (x,y) e EN Be, 
then x e 4, N A, and y e« B, N Bs, so that 
FE nN EB, ex (Ay nN Ap) xX (By nN Bo). 


On the other hand, by Theorem B, (4, N 42) X (Bi N Be) is 
contained in E, and E, and therefore in E, N Eo, so that 


A a E, = (A, nN A) x (B, nN Bo). 


Since S and T are rings, 4, N 4, eS and Bi N Be eT. It follows 
immediately that the class R is closed under the formation of 
finite intersections. 

Since 


(4, X Bi) — (42 X Be) = 
= (A n 42) x (By — B,)) U (A = 42) x Byl, 


we see that the difference of two sets of the given form is a disjoint 
union of two other sets of that form; since 


Ur: = Ui: F; = Ui-1 Ni (E; — F)), 


it follows, using the resilt of the preceding paragraph, that the 
class R is closed under th, formation of differences. Since R is 
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obviously closed under the formation of finite, disjoint unions, 
the proof of the theorem is complete. J 

Suppose now that in addition to the two sets X and Y we are 
also given two o-rings S and T of subsets of X and Y respectively. 
We shall denote by S X T the o-ring of subsets of X X Y gener- 
ated by the class of all sets of the form 4 X B, where 4 eS 
and B eT. 


Theorem F. Jf (X,S) and (Y,T) are measurable spaces, 
then (X X Y,S X T) ts a measurable space. 


The measurable space (X X Y, S XT) is the Cartesian 
product of the two given measurable spaces. 

Proof. If (x,y) eX X Y, then there exist sets 4 and B such 
that xe deS and yeBeT; it follows that (~y)ed4X Be 
SxT. J 

We observe that this is the first time we ever referred to the 
fact that a measurable space is the union of its measurable sets; 
in the present chapter we shall make essential use of this property 
of measurable spaces. 

We shall frequently use the concept of measurable rectangle. 
Two equally obvious and natural definitions of this phrase suggest 
themselves. According to one, a rectangle in the Cartesian 
product of two measurable spaces (X,S) and (Y,T) is measurable 
if it belongs to S X T, and, according to the other, 4 X B is 
measurable if 4eS and BeT. It is an easy consequence of 
the results we shall obtain that for non empty rectangles the two 
concepts coincide; for the time being we adopt the second of our 
proposed definitions. We may say, accordingly, that the class 
of measurable sets in the Cartesian product of two measurable 
spaces is the o-ring generated by the class of all measurable 
rectangles. 


(1) The intersection of any countable class of [measurable] rectangles is a 
[measurable] rectangle. Does this statement remain true if the word ‘“count- 
able” is omitted? 

(2) The “only if” part of Theorem B, Theorem C, and the necessity of the 
condition in Theorem D are all false for empty rectangles. 

(3) Under the hypotheses of Theorem E, the class P of all sets of the form 
4X B, where 4¢S and Be T, isa semiring. Is this statement true if S and T 
are not necessarily rings, but merely semirings? 
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(4) If the rings S and T in (3) each contain at least two different non empty 
sets, then P is not a ring. 

(5) A necessary and sufficient condition that S X T be a o-algebra is that 
both S and T be o-algebras. 

(6) If (X,S) and (Y,T) are measurable spaces, then every measurable set in 
X X Y is contained in a measurable rectangle. (Hint: the class of all those 
sets which may be covered by a measurable rectangle is a o-ring.) 


§ 34. SECTIONS 


Let (X,S) and (Y,T) be measurable spaces and let (X x Y, 
S X T) be their Cartesian product. If E is any subset of X X Y 
and x is any point of X, we shall call the set E, = { y: (x,y) e E} 
a section of F, or, more precisely, the section determined by w. 
At times when it is important to call attention not so much to 
the particular point which determines the section as merely to 
the fact that the section is determined by some point of the space 
X (and is therefore a subset of Y), we shall use the phrase 
X-section. The main point is to distinguish such a section from 
a Y-section determined by a point y in Y; the latter is defined, of 
course, as the set EY = {x: (x,y) eZ}. We emphasize that a 
section of a set in a product space is vof a set in that product space 
but a subset of one of the component spaces. 

If f is any function defined on a subset E of the product space 
X X Y and » is any point of X, we shall call the function /,, 
defined on the section E, by 


Se(y) =f), 


a section of f, or, more precisely an X-section of f, or, still 
more precisely, the section determined by x. The concept of a 
Y-section of f, determined by a point y in Y is defined similarly 


by f(x) = f(x,y). 


Theorem A. Every section of a measurable set is a measur- 
able set. 


Proof. Let E be the class of all those subsets of X & Y which 
have the property that each of their sections is measurable. If 
E = AX B is a measurable rectangle, then every section of E 
is either empty or else equal to one of the sides, (4 or B according 
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as the section is a Y-section or an X-section), and therefore 
E cE. Since it is easy to verify that E is a o-ring, it follows that 
SXT cE. J 


Theorem B. Every section of a measurable function is a 
measurable function. 


Proof. If / is a measurable function on X X Y, if x is a point 
of X, and if M is any Borel set on the real line, then the measur- 
ability of N(fz) N fz *(M) follows from Theorem A and the 


relations 


f2~(M) 


i} 


{yz fe(y) eM} = {y:f(x,y) eM} = 
{y: (ey) ef 1(M)} = (f71(M))-. 


(Observe that N(fz) = (N(/))z.) The proof of the measurability 
of an arbitrary Y-section of fis similar. J 


tt 


(1) If x is the characteristic function of a subset E of X X Y, then xz and x* 
are the characteristic functions of Ez, and E¥ respectively. If, in particular, x 
is the characteristic function of a rectangle 4 X B, then 


xy) = xa(x)xa(y). 


Every section of a simple function is a simple function. 

(2) Let X = Y = any uncountable set, and S = T = the class of all count- 
able subsets. If D = {(x,y): x = y} is the “diagonal” in X x Y, then every 
section of D is measurable but D is not; in other words the converse of Theorem 
A is not true. 

(3) If an extended real valued function f defined on the Cartesian product of 
two measurable spaces X and Y has the property that, for every Borel set M/ on 
the real line, f~1(M) intersects every measurable set in a measurable set, then 
every section of f also has that property. Does this assertion remain valid if 
the definition of measurable space is altered by omitting from it the requirement 
that the space be the union of its measurable sets? What are the implication 
relations between this property and measurability? 

(4) Anon empty rectangle is a measurable set if and only if it is a measurable 
rectangle. (Hint: if 4 X B is measurable, then every section of 4 X B is 
measurable.) 

(5) Let (X,S) be a measurable space such that X eS (i.e. such that S is a 
s—algebra); let Y be the real line, and let T be the class of all Borel sets. If/ is 
a real valued, non negative function on X, then the upper ordinate set of / is 
defined to be the subset 


V*(f) ={(xy):eX, OS y Sf(x)} 
of X X Y, and the lower ordinate set of / is 
Pe) = {(x,y): «eX, Osy <f(x)}. 
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(Observe that, for instance, the lower ordinate set of the function identically 
equal to zero is empty.) The following considerations are at the basis of an 
alternative treatment of measurable functions. 

(Sa) Iff is a non negative simple function, then V*(f) and /4(f) are measur- 
able. (Hint: each of the sets is the union of a finite number of measurable 
rectangles.) 

(5b) If f and g are non negative functions such that /(x) S g(x) for all «x, 
then V*(f) C V*(g) and Vs(f) C Vx (g). 

(5c) If {fa} is an increasing sequence of non negative functions converging 
at every point tof, then {/«(/,)} is an increasing sequence of sets whose union 
is V«(f); similarly if {f,} is decreasing to f, then {*(/,)} is a decreasing se- 
quence of sets whose intersection is V*(f). 

(5d) If f is a non negative measurable function, then V*(f) and V4(f) are 
measurable sets. (Hint: if f is bounded, then there exist sequences {g,} and 
{na} of simple functions such that 


0S gn 3 8ng1 Sf Shanti Shan, m= 1,2,--° 


and such that lima gn = lima fn = f.) 

(Se) If E is any measurable set in X X Y, and if @ and 8 are real numbers 
such that a > 0, then the set {(x,y): (x,ay + 8) © E} is a measurable subset of 
X XY. (Hint: the conclusion is true if E is a measurable rectangle, and the 
class of all sets for which the conclusion is true is a o-ring.) 

(5f) If f is a non negative function such that V*(/) [or Y4(/)] is measurable, 
then f is measurable. (Hint: it is sufficient, for the proof of the unparenthetical 
statement, to show that {x: f(x) > c} is measurable for every positive real 
numbere. If E = V*(f), then 


Us-1 | (9): (« ty + c) eE,y> of = {(%y):S(%) > 9 > 0}; 


the desired result follows from the fact that the sides of a measurable rectangle 
are measurable.) 

(5g) If the graph of a (not necessarily non negative) function f is defined as 
the set {(x,¥):f(~) = y}, then the graph of a measurable function is a measur- 
able set. 


§35. PRODUCT MEASURES 


Continuing our study of Cartesian products, we turn now to the 
case where the component spaces are not merely measurable 
spaces but measure spaces. 


Theorem A. Jf (X,S,u) and (Y,T,v) are o-finite measure 
spaces, and if E is any measurable subset of X X Y, then the 
functions f and g, defined on X and Y respectively by f(x) = 
v(E,) and g(y) = u(E*), are non negative measurable func- 


tions such that f Sap = f gdv. 
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Proof. If 'M is the class of all those sets E for which the con- 
clusion of the theorem is true, then it is easy to see that M is 
closed under the formation of countable, disjoint unions. We 
observe that the o-finiteness of » and » implies that every set in 
S X T may be covered by a countable disjoint union of measur- 
able rectangles, both sides of each of which have finite measure. 
If, therefore, we could prove that every measurable subset of 
every measurable rectangle with sides of finite measure belongs 
to M, it would follow (as stated) that every measurable set belongs 
to M. In other words, we have reduced the proof to the case of 
finite measures; we shall complete the proof (for finite measures) 
by showing that every measurable rectangle (and therefore every 
finite, disjoint union of measurable rectangles) belongs to M, 
and that M is a monotone class. 

If E = 4 XB is a non empty measurable rectangle, then 
f = »(B)xa and g = u(A)xez. It follows that f and g are measur- 


able and that f Miu Dek = f eae: 


The fact that M is a monotone class is a consequence of the 
standard theorems on the integration of sequences of functions, 
specifically 26.D and 27.B. (The finiteness of the measures yu 
and » is used in justifying the application of these results.) 

Since the class of all finite, disjoint unions of measurable rec- 
tangles is a ring (33.E), and since, by definition, the class of 
measurable sets is the o-ring generated by this ring, it follows 
(6.B) that every measurable set is in M, and the proof of the 
theorem is complete. Jf 


Theorem B. Jf (X,S,u) and (Y,T,v) are o-finite measure 
spaces, then the set function \, defined for every sete EinS XT 
by 

NE) = frEsdduls) = fulEd(), 


is a o-—finite measure with the property that, for every measurable 
rectangle A X B, 


(4 X B) = w(A)-v(B). 


The latter condition determines \ uniquely. 
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The measure } is called the product of the given measures yu 
and », in symbols \ = u X v; the measure space (X X Y,S xX T, 
u X v) is the Cartesian product of the given measure spaces. 

Proof. The fact that \ is a measure is a consequence of the 
theorem on the integration of monotone sequences (27.B; cf. also 
27.2). The o-finiteness of \ follows from the fact that every 
measurable subset of X X Y may be covered by countably many 
measurable rectangles of finite measure; uniqueness is implied 


by 13.A. fj 


(1) Let X = Y be the unit interval, and Jet S = T be the class of Borel sets; 
let u(E) be the Lebesgue measure of E, and let »(E) be the number of points in 
E. If D = {(x,y): « = y}, then D is a measurable subset of X X Y such that 


fi v(D,)du(x) = 1 and f u(D*)dv(y) = 0. In other words, Theorem A is not 


true if the condition of o—finiteness is omitted. 

(2) The Cartesian product of two o-finite and complete measure spaces 
need not be complete. (Hint: let X = Y be the unit interval, let 1 be a non 
measurable subset of X, let y be any point of Y, and consider the set M X { y}; 
cf. 34.4.) 

(3) Suppose that (X,S,u) is a totally o-finite measure space and that (Y,T,») 
is the real line with T = the class of all Borel sets and vy = Lebesgue measure; 
let A be the product measure » Xv. We have already seen (34.5) that for any 
non negative, measurable function, and a fortiori for any non negative, integrable 
function f on X, the ordinate sets V*(f) and 4 (/) are measurable subsets of 


X X Y; we now assert that A(’,(/)) = A(Y*(/)) = {fdu. (Hint: in view 


of the known results on approximation of functions by simple functions and 
integration of sequences of functions, it is sufficient to establish the equation 
for simple functions /.) This equation is sometimes used, in an alternative 


approach to integration theory, as the definition of f du; it is a precise formula- 


tion of the statement that “the integral is the area under the curve.” 

(4) Under the hypotheses of (3), the graph of a measurable function has meas- 
ure zero. (Hint: it is sufficient to consider non negative, bounded, measurable 
functions on totally finite measure spaces, and to these the result of (3) applies.) 

(5) If (X,S,u) and (Y,T,y) are o-finite measure spaces, and if X = yw X », 
then, for every set E in H(S X T), A*(E) is the infimum of sums of the type 
Ddov-1 (En), where {E,} is a sequence of measurable rectangles covering E. 
(Hint: cf. 33.3, 10.A, and 8.5.) 


§ 36. FUBINI’S THEOREM 


In this section we shall study the relations between integrals 
on a product space and integrals on the component spaces. 
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Throughout this section we shall assume that 


(X,S,u) and (Y,T,») are o—finite measure spaces and } is the 
product measure p X vy onS X T. 


If a function 4 on X X Y is such that its integral is defined 
(i.e. if, for instance, / is an integrable function or a non negative 
measurable function), then the integral is denoted by 


faGerarey) or fal y)du X ») 9) 


and is called the double integral of 4. If 4, is such that 


f he(y)dv(y) = fx) 


is defined, and if it happens that f Jadu is also defined, it is cus- 


tomary to write 
fteu = ff HCe,rrdo(ydutx) = fdute) [oC 9)do(9). 


The symbols f if A(x, y)du(x)dv(y) and f dv(y) f h(x, y)du(x) are de- 
fined similarly, as the integral (if it exists) of the function g on 
Y, defined by g(y) = f h¥(x)du(x). The integrals fi hdudv and 


hdvdu are called the iterated integrals of 4. To indicate the 


double and iterated integrals of 4 over a measurable subset E 
of X X Y, ie. the integrals of xz, we shall use the symbols 


f hdd, fi f Vdaae and fi ff aed: 


Since X-sections (of sets or functions) are determined by points 
in X, it makes sense to assert that a proposition is true for almost 
every X-section, meaning, of course, that the set of those points 
x for which the proposition is not true is a set of measure zero in X. 
The phrase ‘‘almost every Y~section” is defined similarly; if a 
Proposition is true simultaneously for a.e. X-section and a.e. 
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Y-section, we shall simply say that it is true for almost every 
section. 
We begin with an elementary but important result. 


Theorem A. 4 necessary and sufficient condition that a 
measurable subset E of X X Y have measure zero is that almost 
every X-section [or almost every Y—section| have measure zero. 


Proof. By the definition of product measure we have 


fr@naute), 
(EZ) = 
fi u(E¥) doy). 


If \(Z) = 0, then the integrals on the right are in particular 
finite and hence (by 25.B) their non negative integrands must 
vanish a.e. If, on the other hand, either of the integrands 
vanishes a.e., then \(Z) = 0. J 


Theorem B. Jf h is a non negative, measurable function on 
X X Y, then 


frau x ») = [frauds = [[ravds. 


Proof. If 4 is the characteristic function of a measurable set 
E, then 


frGer)do(y) = Es) and fals,y)duls) = wl, 


and the desired result follows from 35.B. In the general case we 
may find an increasing sequence {4,} of non negative simple 
functions converging to A everywhere, (20.B). Since a simple 
function is a finite linear combination of characteristic functions, 
the conclusion of the theorem is valid for every 4, in place of A. 


By 27.B, lim, f awe f hides TEFG) = f GeO) shen 


it follows from the properties of the sequence {/,} that {fn} 
is an increasing sequence of non negative measurable functions 
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converging for every x to f(x) = f h(x,y)dv(y); (cf. 27.B). Hence 


f is measurable (and obviously non negative); one more applica- 
tion of 27.B yields the conclusion that 


lima [fadu = [fdu. 


This proves the equality of the double integral and one of the 
iterated integrals; the truth of the other equality follows simi- 
larly. 

Both Theorems A and B are sometimes referred to as parts of 
Fubini’s theorem; the following result is, however, the one most 
commonly known by that name. 


Theorem C. If h is an integrable function on X X Y, then 
almost every section of h is integrable. If the functions f and g 


are defined by f(x) = { H(x,y)dv(y) and gy) = [H(x,y)dule), 
then f and g are integrable and 


fru x ») = [fas = [sas 


Proof. Since a real valued function is integrable if and only 
if its positive and negative parts are integrable, it is sufficient to 
consider only non negative functions 4. The asserted identity 
follows in this case from Theorem B. Since, therefore, the non 
negative, measurable functions f and g have finite integrals, it 
follows that they are integrable. Since, finally, this implies that 
f and g are finite valued almost everywhere, the sections of 4 
have the desired integrability properties, and the proof is com- 
plete. Jf 


(1) Let X be a set of cardinal number &y, let S be the class of all countable 
sets and their complements, and, for 4 in S, let u(4) be 0 or 1 according as 4 
is countable or not. If (Y,T,v) = (X,S,u), if Z is a set in X X Y which is count- 
able on every vertical line and full on every horizontal line (cf. 31.11), and if & 
is the characteristic function of EZ, then 4 is a non negative function such that 


facer due) =1 and fAsr)dr(y) = 0. 


Why is this not a counter example to Theorem B? 
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(2) If (X,S,u) and (Y,T,v) are the unit interval with Lebesgue measure, and 
if E is a subset of X X Y such that Ez and X — E are countable for every x 
and y (cf. (1)), then E is not measurable. 

(3) The following considerations indicate an interesting extension of the 
results of this section. Let (X,S,u) be a totally finite measure space and let 
(Y,T) be a measurable space such that Ye T. Suppose that to almost every 
x in X there corresponds a finite measure vz on T so that if (x) = vz(B), then, 
for each measurable subset B of Y, ¢ is a measurable function on X. If »(B) = 


vz(B)du(x), if g is a non negative measurable function on Y, and if f(x) = 


f gl y)dv2(), then/ is a non negative measurable function on X and f Sau = f gav. 


(4) The proof of Fubini’s theorem sometimes appears to be slightly more 
complicated than the one we gave—the complication is caused by completing 
the measure A. In other words, the theorems of this section remain true if 
is replaced by X. (Hint: every function which is measurable (S X T) is equal 
a.e. [X] to a function which is measurable (S X T); (cf. 21.1)). 

In (5)-(9) below, we shall assume that the measure spaces (X,S,u) and 
(Y,T,v) are totally finite. It is easy to verify that the results obtained may be 
extended to totally o-finite measure spaces, and, therefore, to each measurable 
set in the product of two o—finite measure spaces. 

(5) If E and F are measurable subsets of X X Y such that »(Z,) = v(F,) 
for [almost] every x in X, then A(Z) = X(F). (Certain usually not rigorously 
stated special cases of this assertion are known as Cavalieri’s principle.) 

(6) If f and g are integrable functions on X and Y respectively, then the 
function 4, defined by A(x,y) = f(x)g(y), is an integrable function on X K Y 
and 


frau x») = [fdu- fad 


(7) Suppose that n(X) = »(Y) = 1 and that 4p and Bo are measurable sub- 
sets of X and Y respectively such that 4(4o) = v(Bo) = 4. Let x be the char- 
acteristic function of (49 X Y) A(X X Bo) and write f(*,v) = 2x(,y). If, 
for every measurable set E in X X Y, 


X(E) = S flx,y)dd(x,9); 


then A is a finite measure on S X T with the property that \(4 X Y) = u(4) 
and A(X X B) = v(B), whenever 4eS and Be T. In other words, the product 
measure J is not uniquely determined by its values on such special rectangles. 

(8) The existence of the product measure is often proved by the following 
direct but combinatorially somewhat complicated method. The class of all 
finite, disjoint unions of measurable rectangles is a ring R, (33.E); if 


Ur. (4; X B;) and Un: (C; X Dj) 
are two representations of the same set in R, then since 


U?-1 U1 104 NC) x (B: ND) 
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is another representation of the same set, we have 
Dot=1 H(A) (By) = Lf w(C;)-v(Dj). 
In other words, a set function A is unambiguously defined on R by 
MUPa1 (4: X Bi) = Litas w(A4i) (Bi). 


It can be shown (essentially by proving a weakened form of Fubini’s theorem 
for the sets of R) that A is a measure to which the extension theorem (13.A) 
may be applied. 
(9) If 4 and B are arbitrary (not necessarily measurable) subsets of X and Y 
respectively, then 
\*(4 X B) = w*(A)-v*(B). 


(Hint: if 4* and B* are measurable covers of 4 and B respectively, then the 
relation 4 X BC A* X B* implies that \*(4 X B) S u*(4)-v*(B). The re- 
verse inequality may be proved by considering a measurable cover E* of A X B. 
Since E* N (A* X B*) is also a measurable cover of 4 X B, it is permissible to 
assume that E* C 4* X B*. It follows from Fubini’s theorem that 


MES) =f vEs"\du(x) = w(4*)-v*(B).) 


§37. FINITE DIMENSIONAL PRODUCT SPACES 


In the preceding sections we have developed the theory of 
product spaces for two factors; our next task is to investigate 
how this theory may be extended to any finite number of factors. 
We suppose that 7 (>1) is a positive integer, and that X1, ---, Xn 
are sets; we define the Cartesian product of these sets to be the 
set of all ordered 7-tuples of the form (x1, -++, *n), where x; © Xi, 
i=1,---,7. We shall denote this Cartesian product by 


Xy Ke XK Xn or Xie Xe or XK {Xi =1,---, nh. 


If A; is any subset of X;, i= 1, ---, m, the set Kie1 4; is a 
rectangle. 

It is worth while to ask about Cartesian product, as about 
every algebraic operation, whether or not it is associative. If, 
for instance, X, X2, and X3 are three sets, then, without changing 
the order in which they are presented, we may form the three new 
sets (X; X Xo) X X3, X1 X (X2 X X3), and X1 X Xo K X3. In 
what sense may we consider these three Cartesian products to 
be equal? Clearly they do not consist of the same elements; 
it is incorrect to confuse the ordered pair ((x1,%2),%3), whose first 
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element is itself an ordered pair, with the ordered triple (x ,%2,x3). 
Just as clearly, however, there is a “natural” one to one corre- 
spondence between any two of the three Cartesian products 
under discussion, namely the one which makes the points 


((X15%2)%3)y  (X1,(%25%3)), and  (x1,%2,%3) 


correspond to each other. Since it will turn out that this corre- 
spondence preserves all those structural properties of product 
spaces which are of interest to us, we shall wilfully fall into the 
trap we just pointed out and we shall consistently treat the three 
products described above as identical. We shall carry this identifi- 
cation procedure to its logical conclusion, and in the case, for 
instance, of seven factors, we shall consider the element 


(((r1s%2) %3)> ((4s%5), (%65%7))) 


of the set ((X; X X2) X X3) X ((X4 X Xs) X (Xe X X7)) to be 
the same as the element 


(x4 yX29%35%4 5X5, 6,7) 


of the set X, XK Xo X X3 XK X4 x X5 X Xe X Xz. 

The identification just described simplifies the language of 
many proofs. Since, for instance, we may view X; X---X X, 
as a repeated product 


(--+((X1 X Xe) X X3) K+++) X Xn 


of two factors at a time, we may prove the analogs of the theorems 
of §33 by mathematical induction on m. Some slight care has to 
be exercised in the formulation of the results. The correct version 
of the generalization of 33.D, for example, is the assertion that if 


E= Xi14:, F= XB, and G= Xi1C; 


are non empty rectangles, then E is the disjoint union of F and 
G if and only if there is aj, 1 <7 S n, such that 4; is the dis- 
joint union of B; and C; and such that 


4; = B; = C,;, for if. 
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The concept of a section (of a set or a function) also requires a 
minor modification; an X;-section, determined by a point x; 
in X;, of a set in X72, X; is a subset of 


X{[Xel sign ixj}. 


If (X;,S;), i = 1, ---, 2, are measurable spaces, we shall denote 


by 
S.xX-:-xS,, or X18, or K{S:i=1,---, 2} 


the o-ring generated by the class of all those rectangles X?., 4; 
for which 4;¢S;, i= 1, ---, ”, and we define the Cartesian 
product of the given measurable spaces as the measurable space 
(Xy X+++ XK Xa, Si X-++X S,). It follows that every section 
of a measurable set [or a measurable function] is a measurable 
set [or a measurable function]. Proceeding by mathematical 
induction, it is now trivial to define the Cartesian product of 
o—finite measure spaces (X;,S,,u;), i = 1, ---, m; there is one and 
only one measure uw (denoted by w, X---X wa) on Sy X-+-X Sh 
such that 


ulA, Xk Ax) a IIi-: Bi(A)) 


for every measurable rectangle 4, X---X 4. The extension of 
Fubini’s theorem is also immediate, so that the integral of any 
integrable function in a product space may be evaluated by form- 
ing the iterated integral in any order. 

It is customary to refer to a product space X = X7%., X; as 
n-dimensional. This terminology is not meant to define dimen- 
sion, nor to assert that m—dimensionality is an intrinsic structural 
property of a space; it serves merely to remind us of the way in 
which X was built from the components X;. A measure space 
might appear as three dimensional in one context and two di- 
mensional in another; if, for instance, n = 3, then we may view 
X as X; X Xp X Xz or as Xo X Xz, (where Xo = Xy X Xp). 


In (1)-(5) below we shall assume that X; is the real line, S; is the class of al: 
Borel sets, and yu; is Lebesgue measure, # = 1, --+, 7; we write 


(X,S,u) = Xoat (X;,Sihs). 
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(1) Sets of the o-ring S are called the Borel sets of n-dimensional Euclidean 
space. The class of all Borel sets coincides with the o-ring generated by the 
class of all open sets. 

(2) If @ is a Borel measurable function on X, and if fi, +++, fn are real valued, 
measurable functions on a measurable space (Y,T) for which YeT, then the 
function f, defined by f(y) = o(/i(y), -*+,fn(y)), is a measurable function on 
Y; (cf. 19.B). 

(3) The completed measure ff is called n-dimensional Lebesgue measure; 
most of the results of §§ 15 and 16 are valid for @. If, in particular, U and C 
are the class of all open sets and the class of all closed sets respectively, then, for 
every set E in X, 


p*(E) = inf {u(U): EC UeU} and ps(E) = sup {u(C): ED CeC}. 
(4) If T is any linear transformation defined by 
T(x, ++ Hn) = (My oo Inds We = Dhar aii thy FH 1,0 % 
then, for every set E in X, 
u*(T(E)) =|A|-w*(E) and ys(T(E)) = |A|-ne(E), 


where A is the determinant of the matrix (a:;). (Hint: it is sufficient to prove 
the assertion for measurable rectangles E whose sides are intervals. Treat first 
the following special cases. 

(4a) 9, =e +O,f =1, +--+, 

(4b) y: = xi if i ~ j and t ¥ ky yj = x, and yx = xj. 

(4c) ys = x: if § A i393 = xy te xn, where k ¥ j. 

(4d) Ji = My if i ¥ 3 Ji = CX;. 


The general case follows from the fact that JT may be written as the product 
of transformations of the types (4a)-(4d).) 
(5) The function ¢; on X, defined by 


Oi 0+, Hn) =H Hy JH 1 ---y, 
is measurable. 

(6) There is a way of defining n-dimensional Lebesgue measure which does 
not make use of the general theory of product spaces. To indicate this method, 
we shall consider the space X1 X-+-X Xn, where X; = X = the unit interval. 
For every x in X, let « = .ayaea3--+ be a binary expansion of « and write 


Hi = OjOnzidangict', F= lees. 


(For each x which has two binary expansions, select a definite one of them, say for 
instance the terminating one.) The transformation T from X to X; X-+-X Xn, 
defined by T(x) = (*1, +--+, *n), has the property that ifaset Ein Xy X---K Xn 
is measurable, then T~'(E) = {x: T(x) © E} is a measurable subset of X. (For 
the proof, consider the case in which E is a rectangle whose sides are intervals 
with binary rational end points.) The equation (41 X°°> X un) (EZ) = w(T-(E)) 
(where » is Lebesgue measure in X) may be used as the definition of the product 
measure ft) X--- X pn; this definition is consistent with our earlier one. 
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(7) By the familiar zig-zag diagonal process, i.e. by writing 
XL = 0020407" ++, 
H2 = .Ag@pQrgQj2° ++, 
M3 = .gQigQj3Qig***, 
M4 = 10191401 190l25°* *, 


the procedure of (6) may be extended to yield a definition of product measure 
in an “infinite dimensional” analog of Euclidean space. 


§ 38. INFINITE DIMENSIONAL PRODUCT SPACES 


The first step of an extension of product space theory to 
infinitely many dimensions suggests itself naturally. If {X,} is 
a sequence of sets, the Cartesian product 


X= Xin X; 
is defined as the set of all sequences of the form (x1, x2, -:+) where 
xi eX; i= 1, 2, ---. If each X; is a measure space, with a 


o-ring S,; of measurable sets and a measure yj, it 1s not quite 
clear, however, how the concepts of measurability and measure 
should be defined in X. In this section we shall show how this 
may be done, under the assumption that the spaces X; are totally 
finite measure spaces such that u,(X;) = 1,7 = 1,2, ---. We 
observe that the measure on every totally finite measure space 
(X,S,u), for which w(X) ¥ 0, may be trivially altered (by divid- 
ing the measure of every measurable set by »(X)) so that the 
measure of the entire space is 1. We shall see, however, that 
since the number 1 plays a distinguished role in the formation of 
products (particularly of infinite products), the condition p;(X;) 
= 1 is not merely an inessential normalization. 

Suppose then that, for each i = 1, 2, ---, X; is a set, S; is a 
o—algebra of subsets of X;, and u; is a measure on S; such that 
ui(X;) = 1. In this case we define a rectangle as a set of the 
form Xj, 4;, where 4; C X; for all i and 4; = X; for all but a 
finite number of values of 7. We define a measurable rectangle 
as a rectangle X;-1 4; for which each 4; is a measurable subset 
of X;; in view of the preceding definition, this condition is a 
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restriction on only a finite number of the 4’s. A subset of 
Xin1 X; will be called measurable if it belongs to the o-ring S 
(which is in fact a o—algebra) generated by the class of all measur- 
able rectangles; we shall writeS = X71 Si. 

Suppose that J is any subset of the set J of all positive integers; 
we shall say that two points 


x = (x1, %,---) and y = (1, y2, ---) 


agree on J, in symbols x = y (J), if x; = y; for every 7 in /. 
A set E in X is called a J-cylinder if x = y (J) implies that x and 
y belong or do not belong to E simultaneously. In other words, 
E is a J-cylinder if altering those coordinates of any point whose 
index is not in J cannot remove the point from £, nor insert it 
into E if it was not already there (cf. 6.5d). If, for instance, 
J = {1, «++, 2} and 4; is an arbitrary subset of Xj,7 = 1, ---, 2, 
then the rectangle 4; X---X An X Xngi X Xnge X°*: is a 
J-cylinder. 
We shall write 


XM = ». Cane. € n= 0, 1,2, rey 


in view of our identification convention for product spaces, we 
may write X = Ki, Xi; = (Xi X---X Xn) X XM. Since each 
space X™) is an infinite dimensional product space such as 
X(= X), the considerations applied (above and in the sequel) 
to X, may also be applied to X. For every point (x, --+, xn) 
in X,; X---X X, and every set E in X, we shall denote by 
E(x, +++, %n) the section of E (in X) determined by (x1, «++, a). 
We observe that every such section of a [measurable] rectangle 
in X is a [measurable] rectangle in X™. 


Theorem A. Jf J = {1,---, 1} and if a subset E of X is a 
[measurable] J-cylinder, then E = AX X™, where A is a 
[measurable] subset of Xi X--+-X Xn. 


Proof. Let (%n41, ¥n42, °-+) be an arbitrary point of X™ and 
let 4 be the X-section (in X, X---X Xn) of E, determined by 
this point. Since both the sets E and 4 X X“ are J—cylinders, 
it follows that if a point (#1, ¥2, ---) of X belongs to either of them, 
then so does the point (*1, +++, %ny ¥n41) Xanga, °°*). It is clear, 
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however, that if a point of this latter form belongs to either one 
of the sets E and 4 X X™, then it belongs also to the other. 
Using once more the fact that both these sets are J-cylinders, 
and hence that if (x1, +++, ¥n, ¥n41) ¥n42, °**) belongs to either 
of them, then so does («1 «++, Xn, %n+41,) Xn42) °**), we conclude 
that E and 4 X X™ consist of the same points. The fact that 
the measurability of EZ implies that of 4 follows from 34.4. 

If m and n are positive integers, m <n, then it may happen 
that a non empty subset E of X is simultaneously a {1, ---, m}- 
cylinder and a {1, ---, #}-cylinder. By Theorem A we conclude 
that 

E=AxXX™ and E=BX xX, 


where 4c X, X:-:X Xm and BCX, X::-XK Xn. Since we 
may rewrite the first of these relations in the form 


E= (AX Xm Ki Xn) x x. 


it follows from 33.C that B= 4 X Xmui X++*>X Xn. Conse- 
quently if E is measurable, so that both 4 and B are measurable, 
then 

(ur X +++ X om)(A) = (ur X +++ X Mn) (B). 


It follows that a set function » is unambiguously defined for 
every measurable {1, ---, 2}-cylinder 4 X X™ by the equation 


w(A X X™) = (uy X +++ X pn) (A). 


We shall denote the domain of definition of y, 1.e. the class of all 
measurable sets which are {1, ---, #}—cylinders for some value 
of n, by F; the sets of F may be referred to as the finite dimen- 
sional subsets of X. It is easy to verify that F is an algebra, 
that S(F) = S, and that the set function » on F is finite, non 
negative, and finitely additive. 

We shall denote the analogs of F and uw in the space X™, 
n=1,2,---, by F™ and w™ respectively. It follows from our 
results for finite dimensional product spaces that if E belongs tc 
F, then every section of the form E(m, ---, xn) belongs to F™, 
and 


n(E) =[- fu En, +++, Xn))dui (x1) +++ dun(xn). 
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Theorem B. Jf {(X;,S,,u:)} is @ sequence of totally finite 
measure spaces with u{X;) = 1, then there exists a unique 
measure wu on the o-algebra S = K;-18; with the property 
that, for every measurable set E of the form A X X™, 


w(E) = (ur X++*X wn) (A). 


The measure yu is called the product of the given measures 
Mi, M = Xie1 wi; the measure space 


(Xm Xiy Xint Si, Ximt 7) 


is the Cartesian product of the given measure spaces. 

Proof. In view of 9.F and 13.A, all we have to prove is that 
the set function » on the algebra F of all finite dimensional measur- 
able sets is continuous from above at 0, ie. that if {E,} is a 
decreasing sequence of sets in F such that 0 < e S$ u(E;), 7 = 1, 
2, +++, then ()1 E; <0. 


If F; = {a2 bp (E;(*1)) > ‘| , then it follows from the relation 
ws) = fu Eo) duno) = 


= fo Este ))dualor) +f ws) di (en) 
F; F; 
that w(E;) S m(F;) + ; , and therefore that 
m(Fj) = . 


Since {F;} is a decreasing sequence of measurable subsets of X;, 
and since yw, (being countably additive) is continuous from above 
at 0, it follows that there exists at least one point %, in X; such 


that hp (E;(%)) 2 aa = 1,2,--++. Since {£,(%)} is a decreas- 


ing sequence of measurable subsets of X, the argument just 
applied to X, {E;}, and « may be repeated for X, {E;(%,)}, 


and 5 . We obtain a point %2 in Xe such that p(E;(%, %)) = z ; 
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j = 1,2, --+. Continuing in this manner, we obtain a sequence 
{%1, Xe, ---} such that %, ¢ Xn, 2 = 1,2, ---, and 


w(E, By ++) Be) Bee J 

The point (%,, %:, ---) belongs to (\j_, E;. To prove this 
assertion, we consider any particular E; and we select the positive 
integer 7 so that E; is a {1, ---, m}-cylinder. The fact that 
uw (E;(%A, +++, ¥n)) > O implies that EZ; contains at least one 
point (%, ¥2, +++) such that x; = %; fori = 1, ---, . The fact 
that E; is a {1, ---, 2}-cylinder implies then that (%, Xa, -- -) itself 
belongs to E;. § 


(1) It is not essential for the results of this section that the index set J is 
the set of positive integers; any countably infinite set may be used for J. (The 
space X = XK {X;:ie/} consists, by definition, of all functions x defined on J 
and such that their value «(#) at each index 7 is a point of X;.) The proof of this 
assertion may be carried out by an enumeration of J, i.e. by establishing an 
arbitrary but fixed one to one correspondence between the given set J and the 
set of positive integers. The case, for instance, in which J is the set of a// integers 
has many applications. 

(2) The generalization of product space theory to uncountably many factors 
is surprisingly easy. If J is an arbitrary index set, and if, for each 7 in J, 
(X;,S,,us) is a totally finite measure space with u;(X;) = 1, then we may define 
X = X {Xi ie J} asin (1), and the concepts of rectangle, measurable rectangle, 
and measurable set verbatim as in the countable case. Since the class of all 
those sets which are J-cylinders for a countable subset J of I is a o-algebra 
containing all measurable rectangles, it follows that each measurable set E is a 
J-cylinder for a suitable J. If u(Z) is defined to be (Xjey uj) (E), then pu is a 
measure on the class of all measurable sets and uw has the product property which 
justifies its being denoted by Xie 1b. 

(3) It is trivial to combine the theories of finite and infinite dimensional 
product spaces and thus to produce a theory of product spaces in which a finite 
number of the factors is not required to be a totally finite measure space but 
allowed to be o-finite. 

(4) If X = Xi, X; is a product space such as the one described in Theorem 
B, and if, for each #, E; is a measurable set in X;, then E = X¢.1 E; is a measur- 


able set in X and 
u(E) = Tit 1 ui(E:) = lim, TIt-: ui(E). 


(Hint: if Fx = Ey X---X Ex X X™, then {F,} is a decreasing sequence of 
measurable sets in X such that 


sei Fa = XP. A: and w(F,) = []f-1 wi(E).) 
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(5) It is possible to use the theory of product spaces to give a completely non 
topological construction of Lebesgue measure on the real line (cf. the proof of 
8.C), and hence on n-dimensional Euclidean space (cf. 37.6). To obtain such a 
construction let (X0,So,u0) be the measure space whose points are the two real 
numbers 0 and 1, with So = the class of all subsets of Xo, and uo({O}) = po({1}) 
= 4. Foreaché = 1,2, ---, write (X;,S;,ui) = (Xo,So,u0), and form the product 


space 
(X,S,u) = (Xi 1X, Keer S:, Xr 1 Hi). 


(5a) For each point x = (x1, *2, «-+) in X, the set {x} is measurable and 
u({x}) = 0. (Hint: cf. (4).) 

(5b) The set E of all points « = (x1, *2, --+) in X for which x; = 1 for all 
but a finite number of values of # is measurable and has measure zero. (Hint: 
E is countable.) We shall write. X = X-E, and, in what follows, we shall 
consider the measure space (X,5,f), where § = SN ¥ and REN X) = 
u(E), ES. 7 

(5c) If for each x = (x1, x2, ++) in X we write 2(x) = Df. «:/2', then the 
function z establishes a one to one correspondence between X and the interval 

= {2:0 Sz <1}. (Hint: consider the binary expansion of each z in Z with 
the agreement that, if the expansion is not unique, the terminating expansion 
is selected in preference to the infinite one.) 

(5d) If d= {z: OS aSz<bS 1}, and E= {x: 2(x)e A}, then E is 
measurable and @(E) = 4— a4. (Hint: it is sufficient to consider the case in 
which a and 4 are binary rational numbers.) 

(Se) If A is any Borel set in Z and E = {x: 2(x) & A}, then E is measurable 
and @(E) is equal to the Lebesgue measure of 4. (Hint: the set function », 
defined by »(4) = fi(E), is a measure which coincides with Lebesgue measure on 
intervals.) 

The considerations of (5a)-(Se) serve to construct Lebesgue measure on the 
interval Z. Lebesgue measure on the entire real line may be obtained by con- 
sidering the line as a countable, disjoint union of such intervals. Alternatively 
we may consider the space J of all integers (with the class of all subsets of J 
playing the role of the class of measurable sets and the measure of a set defined 
to be the number of its points), and observe the existence of an obvious one to 
one correspondence between the real line and the product space I X Z. 

(6) A construction similar to the one in (5) may be obtained by considering 
the space (X0,So,40), where Xo is the set of all positive integers, So is the class of 
all subsets of Xo, and wo(E) = >-1e227*. We form as before the product space 
X = Xf.1 Xi, whose points this time are sequences of positive integers. For 
each x = (x1, x2, +++) in X we write 


Gy = We Dart, 


By the consideration of binary expansions it may be proved that the conclusions 
of (5c), (Sd), and (5e) are valid for this z. 

(7) Suppose that Xo = {x: 0 S xo < 1} is the semiclosed unit interval; let 
So be the class of all Borel sets in Xo and let uo be Lebesgue measure on So. 


We write 
(XiSime) = (Xo,Somo), # = 1,2, ---, 
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and we form the product space X = Xf.1.X;. There exists a one to one corre- 
spondence between X and Xo such that every Borel set in Xo corresponds to a 
measurable set (i.e. to a set belonging to X¢u1S;) in X, and such that corre- 
sponding sets have equal measures. (Hint: if Yo is the two-point space described 
in (5) and denoted there by Xo, and if Yi; = Yo fori = 0, 1,2, --- andj = 1, 
2, +++, then X; = XP1 Yu, i = 0, 1, 2, «++. The correspondence is based on 
the usual correspondence between doubly infinite sequences, i.e. elements of 
X= X01X%i = Kh KPa Ys, and simple sequences, i.e. elements of 
Xo = Xfn1 Yo;-) 


Chapter VIIT 


TRANSFORMATIONS AND FUNCTIONS 


§39. MEASURABLE TRANSFORMATIONS 


In every mathematical system it is of interest to investigate 
the transformations that leave some or all structural properties 
of the system invariant. While it is not our intention to study in 
great detail the transformations that occur in measure theory, we 
shall in this section discuss some of their fundamental properties. 

A transformation is a function T defined for every point of a 
set X and taking values in a set Y. The set X is called the 
domain of 7; the set of those points of Y which are of the form 
T(x) for some x in X is the range of T. A transformation whose 
domain is X and whose range is in Y is often described as a 
transformation from X into Y; if the range of T is Y, T is called a 
transformation from X onto Y. For every subset E of X, the 
image of E under 7, in symbols T(E), is the range of the trans- 
formation T from E into Y; for every subset F of Y, the inverse 
image of F under 7, in symbols T~1(F), is defined to be the 
set of all those points of X whose image is in F; i.e. 


TO(F) = {x: T(x) © F}. 


A transformation T is one to one if T(x,) = T(x) occurs when 
and only when x, = *2. The inverse of a one to one transforma- 
tion T, denoted by T™, is the transformation which is defined 
for every y = T(x) in the range of T by T7'(y) = x. 

If T is a transformation from X into Y and S is a transformation 
from Y into Z, the product of S and T, in symbols ST, is the 
transformation from X into Z defined by (ST)(x) = S(T(x)). 

161 
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A transformation T from X into Y assigns in an obvious way 
a function f on X to every function g on Y;/f is defined by f(x) = 
g(7T(x)). It is convenient and natural to write f = gT. 


Theorem A. If T is a transformation from X into Y, if 
gis a function on Y, and if M is any subset of the space in which 
the values of g lie, then 


{xz (gT)(x) eM} = T({ y:g(y) © M}). 


Proof. The following statements are mutually equivalent: (a) 
xo e {x: (gT)(x) « M}, (b) g(T(xo)) © M, (c) if yo = T(xo), then 
g(yo) eM, and (d) T(xo) e fy: g(y) eM}. The equivalence of 
the first and last ones of these statements is exactly the assertion 
of the theorem. J 

If (X,S) and (Y,T) are measurable spaces and if T is a trans- 
formation from X into Y, how should the concept of measurability 
be defined for T? Motivated by the special case in which Y 
is the real line, we shall say that T is a measurable transformation 
if the inverse image of every measurable set is measurable. We 
observe that this language is inconsistent with our earlier one 
concerning measurable functions; because of the special role of 
the real number 0, a measurable function is not necessarily a 
measurable transformation. This slight inconsistency is amply 
repaid by convenience in applications; confusion can always be 
avoided by use of the proper one of the terms “function” and 
“transformation.” In the important case in which X itself be- 
longs to S and Y is the real line, the two concepts, measurable 
transformation and measurable function, coincide. 

If T is a measurable transformation from (X,S) into (Y,T), 
we shall denote by T~!(T) the class of all those subsets of X 
which have the form 7~1(F) for some F in T; it is clear that 
T—(T) is a o-ring contained in S. 


Theorem B. Jf T is a measurable transformation from 
(X,S) into (Y,T), and if g is an extended real valued measurable 
function on Y, then gT is measurable with respect to the o-ring 
Tt (); 


Proof. Theorem A implies that, for every Borel set M on the 
real line, 
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N(gT) N (gT)~\(M) = {x: (gT)(x) eM — {0}} = 
= T({y: g(y) eM — {0}}) = T"(N(g) N g(M)); 

it follows from the measurability of T that the set on the left 
belongs to T“'(T). Jj 

A measurable transformation T from (X,S) into (Y,T) assigns 
in an obvious way a set function »v on T to every set function 
yon S; vis defined for every F in T by o(F) = n(T(F)). It is 
convenient and natural to write vy = pT. 


Theorem C. Jf T is a measurable transformation from a 
measure space (X,S,mu) into a measurable space (Y,T), and if g 
is an extended real valued measurable function on Y, then 


fedur-) = (eras, 


in the sense that if either integral exists, then so does the other 
and the two are equal. 


Proof. It is sufficient to treat non negative functions g. If g 
is the characteristic function of a measurable set F in Y, then it 
follows from Theorem A that g7 is the characteristic function 
of T~'(F) and therefore 


fedut>) = GT) = TE) = fea. 


{t follows from this relation that the asserted equality is valid 
whenever g is a simple function. In the general case let { g,} be 
an increasing sequence of simple functions converging to g; then 
{gnT} is an increasing sequence of simple functions converging 
to gT and the desired conclusion follows by taking limits. J 

If, in the notation of Theorem C, F is a measurable subset of 
Y, then an application of Theorem C to the function yrg yields 
the relation 


Jso4ur>) =f, eT) dul). 


We observe that either side of this equation may be obtained from 
the other by the formal substitution y = T(x). 
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Theorem D. If T is a measurable transformation from a 
measure space (X,S,u) into a totally o-finite measure space 
(Y,T,v), such that »5T~ is absolutely continuous with respect 
to v, then there exists a non negative measurable function o on 
Y such that 


fecrerdu(a) = fano(rdao9) 


Sor every measurable function g, in the sense that if either integral 
exists, then so does the other and the two are equal. 


The function ¢ plays the role of the Jacobian (or, rather, the 
absolute value of the Jacobian) in the theory of transformations 


of multiple integrals. 


d(uT 
Proof. Write ¢ = a , (cf. §32), and apply 32.B to the 
y 


result of Theorem C. J 


If T is a one to one transformation from a measurable space 
(X,S) onto a measurable space (Y,T), and if both T and T7 
are measurable, we shall say that T is measurability preserving. 
A measurability preserving transformation T from a measure 
space (X,S,u) onto a measure space (Y,T,v) is measure preserving 
if wl! = pv, 


(1) The product of two measurable transformations is measurable. 

(2) If T is a measurable transformation from (X,S) into (Y,T), and if a func- 
tion f on X is measurable with respect to T~'(T), then f(x1) = f(x2) whenever 
T(x) = T(xe). (Hint: if Fi is a measurable set in Y containing T(x1), then 
there exists a measurable set F in Y such that 


{x: f(x) = flm)} N TA) = THF). 


The fact that x,e T—'(F) implies that x2 e T-'(F).) 

(3) If T is a measurable transformation from (X,S) onto (Y,T), and if a real 
valued function f on X is measurable with respect to T—1(T), then there exists a 
unique measurable function g on Y such that f = gT. (Hint: in view of (2), g 
is unambiguously defined for every y = T(x) by g(y) =/f(x). The fact that we 
have, for every Bore! set M on the real line, 


T71({ y: g(y) © M}) = {x: f(x) e M}, 


implies, since T(X) = Y, that M(g) N {y:g(y)eM}eT.) Does this result re 
main true if T maps X into Y? 
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(4) Suppose that X = Y = the unit interval, S = the class of all Borel sets, 
and T = the class of all countable sets. If the transformation T is defined by 
T(x) = x, then T is a one to one, measurable transformation from X onto Y, 
but T is not measurability preserving. Is it possible to construct such an exam~- 
ple for which (X,S) = (Y,T)? 

(5) If T is a measurable transformation from (X,S) into (Y,T), and if u 
and v are two measures on S such that yp, then pT"! K pT. 


§ 40. MEASURE RINGS 


A Boolean ring is a ring in the usual algebraic sense, with the 
property that every element is idempotent. Equivalently, a 
Boolean ring is a set R and two algebraic operations (called addi- 
tion and multiplication) defined for pairs of elements of R, sub- 
ject to the following restrictions. (a) Both addition and multipli- 
cation are commutative and associative, and multiplication is 
distributive with respect to addition. (b) There exists in R a 
unique element (denoted by 0) such that the result of adding 0 
to any element E is E. (c) The result of adding any element to 
itself is 0. (d) The result of multiplying any element E by itself 
is E. 

A typical example of a Boolean ring is a ring of subsets of a set 
X with EAF and Ef F playing the roles of the sum and the 
product of E and F, respectively. Since our introduction of 
Boolean rings is motivated exclusively by rings of sets, we shall 
adopt the mnemonic device of always denoting addition and 
multiplication in Boolean rings by A and N. 

Most of the concepts we introduced and results we established 
for rings of sets carry over without change to Boolean rings in 
general. If, in particular, the formation of unions and differences 
is defined by 

EUF=(EAP)A(ENFP) 
and 
E-F=EA(ENF), 


then these operations are subject to the same formal identities 
as the corresponding operations on sets. A similar statement is 
true about the inclusion relations E c F and E 3 F, defined by 


ENF=E and ENF=F 
respectively. 
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We recall that the union of any class of sets is the smallest set 
containing them all and their intersection is the largest set con- 
tained in them; similar statements are true about unions and inter- 
sections (as far as they can be formed) in every Boolean ring. 
If, for instance, E and F are elements of a Boolean ring R, then 
E U F is indeed the smallest element containing both E and F; 
ie ECEUF,FCE U F,and, if Gis an element of R for which 
EcG and FcG, then EUF CG. For an infinite set of 
elements in a Boolean ring, however, there need not be any ele- 
ment that contains them all, and, even if there is one, there need 
not be a smallest one. A Boolean o-ring is a Boolean ring S 
with the property that every countable set of elements in S has 
a union; it is easy to verify that every countable set of elements 
in a Boolean o-ring has an intersection. A typical example of a 
Boolean o-ring is, of course, a o-ring of subsets of a set X. 

A Boolean algebra is a Boolean ring R in which there exists an 
element different from 0 (which, for obvious reasons, we shall 
denote by X), with the property that E Cc X for every E in R. 
A Boolean o-algebra is a Boolean o-ring which is a Boolean 
algebra. 

The definitions of the concepts of additivity, measure, o-finite- 
ness, etc. for functions defined on a Boolean ring are the same 
as the corresponding definitions for set functions on a ring of sets. 
A measure » on a Boolean ring is positive if it vanishes for the 
zero element only. 

A measure uw on a o-ring S of subsets of a set X is usually 
not positive. There are, however, several well known procedures 
which have the effect of making a positive measure out of u. One 
such procedure is to consider the class N of measurable sets of 
measure zero and then, after observing that N is an ideal in the 
ring S (these words being used in their customary algebraic 
sense) to replace S by the quotient ring S/N. Another (equiva- 
lent) procedure is to write E ~ F whenever p(E AF) = 0 and 
then, after observing that the relation “~”’ is reflexive, sym- 
metric, and transitive, to replace S by the set of all equivalence 
classes with respect to the relation ~. 

The most usual and most convenient procedure in measure 
theory (which is the one we shall adopt) is still another one. 
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We shall not replace S by another system—the elements of the 
Boolean o-ring that we propose to consider are to be measurable 
sets. We shall, however, redefine the concept of equality; if 
two sets E and F in S are such that n(E A F) = 0, then we shall 
consider them equal and we shall write E = F [yp]. If Z, = Fp [ul, 
n=1,2,---, then 


Ei = Fi = Fa — F, and Ure E, = Ura F, [ul, 


so that even with the altered concept of equality, S is a Boolean 
o-ring with respect to the familiar set operations. If E = F [uy], 
then u(Z) = u(F), so that even with the altered concept of 
equality, the measure uw is unambiguously defined on S. Since 
the statements n(E) = 0 and E = 0 [y] are obviously equivalent, 
we see that, after the alteration of the concept of equality, p 
becomes a positive measure. 

If (X,S,u) is a measure space, we shall use the symbol S(uz) 
to denote the o-ring S with equality interpreted modulo y, as 
described above. 

A measure ring (S,u) is a Boolean o-ring S and a positive meas- 
ure » on S. The preceding considerations show that if (X,S,z) 
is a measure space, then (S(u),u) is a measure ring; we shall call 
it the measure ring associated with X or simply the measure ring 
of X. A measure algebra is a Boolean algebra which is at the 
same time a measure ring. The phrases [totally] finite and 
o—finite are used for measure rings and measure algebras in the 
same way as for measure spaces. 

An isomorphism between two measure rings (S,u) and (T,») 
is a one to one transformation T from S onto T such that 


T(E — F) = T(E) -— TF), Trai En) = Unei TEs), 


and 
B(E) = o(T(E)), 


whenever E, F, and E, are elements of S, » = 1, 2, ---. Two 
measure rings are isomorphic if there exists an isomorphism be- 
tween them. Two measure spaces (X,S,u) and (Y,T,v) are 
isomorphic if their associated measure rings (S(x),u) and (T(v),») 
are isomorphic. 
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An atom of a measure ring (S,u) [or of a measure y] is an ele- 
ment E different from 0 such that if F c £, then either F = 0 
or F = E; a measure ring with no atoms is non atomic. If 
(X,S,u) is a measure space whose measure ring is non atomic, 
then both the measure space X and the measure yu are called non 
atomic. 

If (S,u) is a measure ring, we shall denote by 8 [or S(u)] the set 
of elements of finite measure in S and, for any two elements E 
and F in 8, we shall write 


(EF) = w(E A F). 


It is easy to verify that the function p is a metric for 8; we shall 
call § the metric space associated with (S,u), or, simply, the 
metric space of (S,u). We shall also use the symbol S(u) for the 
metric space associated with the measure ring (S(x),u) of a measure 
space (X,S,u). A measure ring or a measure space is called 
separable if the associated metric space is separable. 


Theorem A. Jf 8 is the metric space of a measure ring (S,n), 
and if 


f(EF) =EUF and g(E,F)=ENF, 


then f, g, and also p, are all uniformly continuous functions of 

their arguments. 

Proof. The desired results are immediate consequences of the 
relations 
w((Ey U Fy) — (Ey U Fe)) + w((F2 U Fe) — (Ai U ae z 
B((E, N Fi) — (E2 F2)) + B( (Es N Fe) — (Fi n F;)) — 
B(E, — Ex) + w(Pi — Fo) + w(E2 — Ei) + w(Fe — Fi) 


IA 


and 
[w(E) — w(F)| =|n(E- F) -n(F-2£)| Ss 
<pw(E-F)+u(F-E£). J 


Theorem B. Jf (X,S,u) is a o-finite measure space such 
that the o-ring S has a countable set of generators, then the metric 
space 8(u) of measurable sets of finite measure is separable. 
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Proof. Let {E,} be a sequence of sets in S such that S = 
S({E,}). Because of the o-finiteness of u, there is no loss of 
generality in assuming that u(E,) < © for every m = 1, 2, 
Since (5.C) the ring generated by {£,} is also countable, we 
may assume that the class {E,: 2 =1, 2, ---} is a ring. It 
follows from 13.D that, for every E in 8(u) and for every positive 
number e, there exists a positive integer 7 such that p(£,E,) < «. 
Since this means that a countable set is dense in 8(u), the proof 
of the theorem is complete. J 


(1) The metric space § of a measure space (X,S,u) is complete. (Hint: if 
{Z,} is a fundamental sequence in $, and if xn is the characteristic function of 
E,, n = 1,2, +++, then {xn} is fundamental in measure and therefore 22.E may 
be applied.) 

(2) Is the metric space of a measure ring complete? 

(3) There is a concept of completeness for Boolean rings which is related to 
but not identical with the concept of the same name for metric spaces. A Boolean 
ring R is complete if every subset E of R has a union. Clearly every complete 
Boolean ring is a Boolean o-algebra; in the converse direction it is true that 
every totally finite measure algebra is complete. (Hint: let E be the set of all 
finite unions of elements of E. Write a = sup {u(Z): Ee}, find a sequence 
{E,} of elements of E such that lim, w(Z,) = a, and set E = Use: E,.) 

(4) The result of (3) remains true for totally o-finite measure algebras. 

(5) If p is the metric on the metric space S of a measure ring (S,u), then p 
is translation invariant in the sense that p(E AG, FAG) = p(£,F) whenever 
E, F, and G are in 8. 

(6) If a one to one transformation T from a measure ring (S,u) onto a measure 
ring (T,v) is such that T(E — F) = T(E) — T(F), T(E U F) = T(E) U TIF), 
and u(E) = »(T(E)), whenever E& and F are in S, then T is an isomorphism. 

(7) If a one to one transformation T from a measure ring (S,u) onto a measure 
ring (T,v) is such that w(Z) = »(T(E£)) and E C Fif and only if T(E) C T(F), 
then T is an isomorphism. 

(8) A metric space $ with metric p is convex if, for any two distinct elements 
E and F in 8, there exists an element G, different from both E and F and such 


that 
P(E,F) = p(E,G) + p(G,F). 


The metric space of a o-finite measure ring is convex if and only if the measure 
ring is non atomic. 

(9) An isomorphism between two measure rings is an isometry between their 
metric spaces. 

(10) A totally o-finite measure ring has (at most) countably many atoms. 

(11) If 8 is the metric space of a measure space (X,S,u) and if pv is a finite 
measure on S such that »“zy, then the function » is unambiguously defined 
and continuous on 8. 

(12) If (X,S,u) is a o-finite measure space and {v,} is a sequence of finite 
signed measures on S such that each p, is absolutely continuous with respect to 
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pw and such that lim, »,(£) exists and is finite for every E in S, then the set 
functions », are uniformly absolutely continuous with respect to yw. (Hint: 
let 8 be the metric space of (X,S,u) and write, for each fixed positive number ¢, 


& = Nauk mene {z: Ee8, | vn(E) — vm(E)| S 3} : 


Since, by (11), each & is closed, and since, by (1), 8 is a complete metric space, 
the Baire category theorem implies that there exists a positive integer ko, a 
positive number ro, and a set Ep in § such that {E: p(E,Eo) < ro} © &,. Let 


5 be a positive number such that 6 < 7) and such that | va(Z)| < ; whenever 
u(E) <6 and n = 1, ---, &. Observe that if u(Z) < 6, then 
p(Ey — E,Eo) < ro and p(Eo U E,Eo) < To, 


and 
|»(E)| Ss 
s | Vx(E) | + | vn(Eo U E) a, Ve (Eo U E) + | vn(Eo = E) pas Veg(Eo ar E) |.) 


(13) If, in the notation of (12), »(Z) = lim, v,(Z), then vy is a finite signed 
measure andy << p. 

(14) If {y,} is a sequence of finite signed measures such that lima va(E) = 
v(E) exists and is finite for every measurable set £, then v(£) is a signed measure. 


(Hint: if | vn(E)| S cn, 2 = 1, 2, ---, write w(Z) = er ele and 


2"Cn 
apply (13).) 

(15a) Every Boolean ring R is isomorphic (in the customary algebraic sense 
of that word) to a ring of subsets of some set X. (Hint: consider the Boolean 
algebra Ro of two elements 0 and 1, and let X be the set of all homomorphisms 
of Rinto Ro. If, for every E in R, 


T(E) = {x: xe X, x(E) = 1}, 


then T is a homomorphism from R into the algebra of all subsets of X; all that 
remains to be proved is that if He R and E * 0, then there exists an x in X for 
which x(E) = 1. If R is finite, this result is easy. In the general case let X* 
be the set of all functions from R into Ro; in the customary product topology 
X* is a compact Hausdorff space. If R is any finite subring of R such that 
EeR, and if X*(R) is the set of all those functions x* in X* which are homo- 
morphisms on R and for which x*(Z) = 1, then the relation 


Ni-1 X*(Ri) D X*R) 


(where R is the ring generated by Ri, ---, Rn) shows that the class {X*(R)} of 
sets has the finite intersection property.) This result is known as Stone’s 
theorem. 

(15b) The proof of Stone’s theorem, outlined above, shows that R is isomorphic 
to a ring of open-closed sets in a compact Hausdorff space. If R is a Boolean 
algebra, then R is isomorphic to the ring of all open-closed sets in a compact 
Hausdorff space. (Hint: changing the notation of (15a) slightly, let X be the 
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set of all those homomorphisms of R into Ro which map the maximal element of 
Ron 1. Then the image of R under T contains a base for the topology of X. 
If a class of open-closed subsets of a compact Hausdorff space is a base and is 
closed under the formation of finite unions, then it contains every open-closed 
set.) 

(15c) Every Boolean o—algebra S is isomorphic to a o—algebra of subsets of 
some set X modulo a o-ideal. (Hint: map S by an algebraic isomorphism T 
on the algebra of all open-closed sets in a compact Hausdorff space X; let So 
be the o-ring generated by the class of all open-closed subsets of X and let Ng 
be the class of all sets of the first category in So. If {E,} is a sequence of open- 
closed sets, write E = T(r; T~(En)); it follows that E — Ur.1 £, is 
nowhere dense. In other words, the class of all open-closed sets is closed, modulo 
No, under the formation of countable unions. The only essential thing that is 
still lacking is the fact, which ensures that T remains an isomorphism even after 
reduction modulo No, that no non empty open-closed set belongs to No; this 
result is, however, a special case of the Baire category theorem, which is just as 
valid for locally compact spaces as for complete metric spaces.) 


§41. THE ISOMORPHISM THEOREM 


The purpose of this section is to prove that the concept of a 
measure ring is not as general as it might appear. We shall show, 
in fact, that every measure ring, subject to certain not too restric- 
tive conditions, is the measure ring of a measure space. Of the 
many theorems of this type we shall discuss only a rather special 
one, which we selected because it is important both historically 
and in current applications. 

In what follows we restrict our attention to totally finite meas- 
ure algebras. If (S,u) is a totally finite measure algebra, then, 
unless we explicitly say otherwise, the symbol X will denote the 
maximal element of S; the algebra S and the measure yp are called 
normalized if u.(X) = 1. A partition of an element E of S is a 
finite set P of disjoint elements of S whose union is E. The 
norm of a partition P = {Fi, ---, Ex}, denoted by | P|, is the 
maximum of the numbers u(Fi), ---, u(Ey). If P = {E,---, Ex} 
is a partition of E and if F is any element of S contained in E, 
we shall write P N F for the partition {Z, NF, ---, Ex NF} 
of F. 

If P, and P, are partitions, we shall write P, < P, if each 
element of P; is contained in some element of P,; a sequence 
{P,} of partitions is decreasing if P,4, S P, form =1,2,---. 
A sequence {P,,} of partitions is dense if to every element E of S 
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and to every positive number e there corresponds a positive integer 
n and an element Ep of S which is equal to a union of elements of 
P,, and is such that p(Z,Eo) = w(E A Ey) < «. 


Theorem A. /f (S,u) is a totally finite, non atomic measure 
algebra, and if {P,,} is a dense, decreasing sequence of partitions 
of X, then lim, | P| = 0. 


Proof. Since {| P,|} is a decreasing sequence of positive 
numbers, it has a limit; we shall derive a contradiction from the 
assumption that this limit is a positive number 6. 

If P, = {Ky ---, Ey}, then at least one of the elements E; 
must be such that 


[P,N E;|26 for n=1,2,--:. 


Let F, be such an element and consider the sequence {P, N F;} 
of partitions of Fj. By a repetition of the argument just used 
we may find an element Fo of the partition P2 such that Fy Cc F; 
and 

[P,NF,| 26 for n=1,2,---, 


and we may proceed so on ad infinitum. 

If F = ()%.1 Fa, then w(F) = 6 > 0, and therefore, since F is 
not an atom, there exists an element Fy such that Py C F and 
0 < w(Fo) < u(F). We observe that the element Fo is either 
contained in or disjoint from every element of each of the parti- 
tions P,, nm = 1,2, ---. It follows that if € is a number smaller 
than either of the numbers p(Fo) and u(F) — u(Fo), then no 
element of S which is a union of such partition elements can have 
a distance less than e from Fy. Since this contradicts the density 
of {P,}, the proof of the theorem is complete. § 


Theorem B. Jf Y is the unit interval, T is the class of all 
Borel subsets of Y, and v is Lebesgue measure on T, and if 
{Qn} is a sequence of partitions into intervals of the maximal 
element Y of the measure algebra (T,v), such that limn| Qn | 
= 0, then {Qn} is dense. 


Proof. To every positive number e there corresponds a posi- 


tive integer 7 such that | Q,| < : . If Eis any subinterval of Y, 
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let E, be the uniquely determined interval of the partition Q, 
with the property that the left end point of EZ is contained in E,. 
If E, does not contain the right end point of E, let Ey be the inter- 
val of Q, which is adjacent to E£; on the right, and proceed so on a 
finite number of times until the process terminates with an 
interval E, of Q, which does contain the right end point of E. 
The union of the intervals Ey, ---, E, is at a distance less than e 
from E; this proves that every subinterval of Y may be approxi- 
mated by unions of elements of {Q,}. Since the class of all finite 
unions of intervals is dense, the proof of the theorem is com- 


plete. J 


Theorem C. Every separable, non atomic, normalized meas- 
ure algebra (S,u) 1s isomorphic to the measure algebra (T,v) of 
the unit interval. 


Proof. Let {EZ,} be a dense sequence in the metric space 
S(u) of (Su). For each m = 1,2, ---, the set of elements of the 
form (}i.1 4;, where, for each i = 1, ---, m, 4; is either E; or 
X — E;, is a partition P, of X. It is clear that the sequence 
{P,,} of partitions is decreasing; the density of {Z,} implies that 
the sequence {P,} of partitions is dense. It follows from Theorem 
A that lim, | P,| = 0. 

To each element E of the partition P; we may make correspond 
a subinterval T(E) of Y so that w(E) = »(T(E)) and so that 
these intervals constitute a partition of Y. Separately within 
each of these intervals we imitate similarly the behavior of Py, 
and we proceed so on by induction. We obtain in this manner a 
sequence {Q,} of partitions of Y into intervals; the fact that the 
transformation TJ, from partition elements of {P,} into intervals, 
is measure preserving implies that lim, | Q, | = 0, and therefore, 
by Theorem B, that {Q,} is dense. 

If we define T not only for partition elements occurring in 
{P,} but also for finite unions of such elements by assigning to 
each such finite union the corresponding finite union of partition 
elements of {Q,}, then the transformation T is an isometry from 
a dense subset of the metric space S(u) onto a dense subset of 
3(v). It follows that there is a unique isometric transformation 
T from $(u) onto 3(v) which coincides with T everywhere that 
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the latter is defined. Since T preserves unions and differences, 
and since these operations are uniformly continuous functions of 
their arguments, it follows that Tis an isomorphism. J 


(1) If (S,u) is a o-finite, non atomic measure ring and if Eye S, Ey ¥ 0, 
then, for every positive number e, there exists an element E of S such that 
EC Ey and 0 <p(E) <e. (Hint: if u(Eo) < 0 and if EZ; is an element of S 
such that E; C Ey and 0 < w(E;) < w(Eo), then either u(E1) S $u(Eo) or 
u(Eo — Ex) S 3u(Eo).) 

(2) If (S,u) is a o-finite, non atomic measure ring, and if Ey eS, then, for 
every extended real number a with 0 S @ S u(Ep), there exists an element E 
in S such that E C Ep and p(E) = a. (Hint: since the case a = © is trivial, 
there is no loss of generality in assuming that u(Zo) <0. The desired result 
follows by a transfinite exhaustion process. The method is similar to the one 
used in proving that any two points in a complete, convex metric space may be 
joined by a segment, and in fact the present assertion is a special case of this 
general theorem in metric geometry; (cf. 40.2 and 40.8).) 

(3) If (S,u) is a totally o—finite, non atomic measure algebra, and if EoeS, 
then, for every extended real number @ with u(E) S a S u(X), there exists an 
element Z in S such that Eo C E and p(Z) =a. (Hint: if @ is finite, apply (2) 
to X — Eo and p(X) — a.) 

(4) If (S,z) is a totally finite measure algebra, then the set of all values of yu 
is a closed set. 

(5) If a o—finite, non atomic measure ring (S,u) contains at least one element 
different from 0, then its metric space S8(#) has no isolated points. Is it true that, 
conversely, if S(4) has no isolated points, then (S,u) is non atomic? 

(6) Every separable, non atomic, totally o-finite measure algebra (S,u) such 
that p(X) = 0, is isomorphic to the measure algebra (T,v) of the real line. 
(Hint: it follows from (2) that there exists a sequence {Xn} of elements in S 
such that X = U%_1 Xn and w(X,) = 1, 2 = 1, 2, ---, and hence such that 
Theorem C is applicable, for each m, to the algebra of subelements of Xp.) 

(7) Every measure algebra is isomorphic to the measure algebra of a measure 
space; (cf. 40.15c). 


§ 42. FUNCTION SPACES 


There are certain metric spaces associated with an arbitrary 
measure space (X,S,u) which are similar to the space S(u) of 
measurable sets of finite measure. The one lying nearest at hand 
is the class £, (or £,(u)) of all (extended real valued) integrable 
functions. If for fin £,; we write 


fll = flr le 


and for f and g in £; we write p(/,g) = || f — g ll, (cf. § 23), then 
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the function p has all but one of the usual properties of a metric. 
The missing property is, of course, the positiveness of p; if p(/,g) 
= 0, it does not necessarily follow that f = g. We know what 
does follow: by 25.B, p(/,g) = 9 is equivalent to f = g[u]. We 
adopt again the same attitude as in the case of the space of 
measurable sets of finite measure. Two elements (i.e. functions) 
in £, are to be regarded as equal if their distance is zero, or, 
equivalently, if they are equal almost everywhere [y]; with this 
understanding £, becomes a metric space which (cf. 26.B) ig 
even known to be complete. 

For some purposes in analysis it is desirable to generalize these 
considerations. If p is a real number, p > 1, we shall denote by 
Lp (or Lp(u)) the class of all those measurable functions / for 
which | f|? is integrable. In analogy with the situation in £, 
we shall identify two elements of £, if they are equal almost 
everywhere [yu]; up to a certain point the theory of £, imitates 
that of £; very closely. We define, for instance, for f in £5, 


isis = (flere) 


and we write, if f and g are in Lp, pp(fig) = ||f —g llp. At this 
stage we run into difficulties. While it is clear that pp(f,g) = 
pp(g,f) 2 0, and while it is equally clear that p,(f,g) = 0 if and 
only if f = g [uJ, itis not clear that the triangle inequality is valid, 
nor yet, and this is much more serious, that p, is always finite. 
In order to settle these difficulties we proceed now to present 
proofs of two classical results; the following one is known as 
Hélder’s inequality. 


Theorem A. If p and q are real numbers greater than 1 
such that - pan = 1, and if fe &, and ge &q, then fge&y 
and \| fg || S Wf llll le 
Proof. We consider an auxiliary function @ defined for all 

Positive real numbers ¢ by ¢(¢) = - + . . Differentiating we 


obtain 
eQ=Ptao re, 
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so that 1 is (in the domain under consideration) the only critical 
value of ¢. Since 


lim, 0 o(t) ia lim, « (4) =, 


it follows that the value of ¢ at 1 is a minimum, and therefore 
that 
| ae ia ae | 
—~+—=¢2 460) =—-+-=1. 
P 4 P 4 
If @ and 4 are any two positive numbers, and if we write ¢ = 
a'!4/$1!?, it follows that 
qe ga a? bf 
+ or abS-+—, 
bp aq Pp 4 
and it is clear that the latter inequality remains valid even if a 
and # are allowed to be 0. 
We turn now to the proof of the theorem. If either || f ||, = 0 
or || g ||, = 0, then the result is trivial; in all other cases we may 


write | | | 
pe 38 2 BL 
oa ee il 


Applying the last written inequality we obtain 
el Sho 9 ge 


eM Ne PT pa ” fig ledu 


Since fg is measurable, this inequality shows already that fg © £1; 
by integrating it we obtain the desired result. 
Our next result is known as Minkowski’s inequality. 


Theorem B. If p is a real number greater than 1, and if f 
and g are in &py, then f + ge Sp and 


If+ elle SIF lle + ll g ll 


Proof. H6lder’s inequality for a measure space containing 
two points, each of measure 1, yields the elementary inequality 


| ab, + Azbe | s (| Qa, | + | a2 [pytin(] i le + | be eye, 


ls 


[Sec. 42] TRANSFORMATIONS AND FUNCTIONS 177 


ee | 
where, as before, r +-=1. It follows that 
q 


lftePsifl-lf+ePtt+iel ste s 
= (fl +] gl)? Ql st gle), 
and hence that 
p+? s 271 F)? + | 2/7). 


This implies that f + g © £,; the desired inequality follows from 
the relations 


(lf + |p)? = fist glrau < 


s fislist gtd +figllf+elms 


s(fisan) "(fis+elen) 
+ (flevar) (fist ear)” = 


a (iF llp h ll g Inf +g lpr. € 


Since it follows from Theorem B that if f, g, and # are in Lp, 
then 


pr f,g) = If “Re: llp s If =A llp oF | hi & Np = Ppl fA) ate pp(A,g); 


we see that £, is indeed a metric space; the proof that serves to 
establish the completeness of £; carries over with only trivial 
changes and establishes the completeness of Lp. 


(1) The metric space £,(#) on a measure space (X,S,u) is separable if and 
only if the space 8(u) of measurable sets of finite measure is separable. (Hint: 
if a class of sets is dense in S(u), then the set of all finite linear combinations with 
rational coefficients of the characteristic functions of these sets is dense in £p(1).) 

(2) Another occasionally useful space is the set 90 of all essentially bounded 
measurable functions. If we write, for any / in I, 


Il flo = ess. sup. {| f(x) |: xe X} 
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and, for f and g in M, pa(fig) = || f — gle, then om (with our by now familiar 
conventions as to what constitutes equality for two elements of a measure 
theoretically defined function space) becomes a complete metric space. 

(3) The space £2 is deservedly the most extensively studied of the function 
spaces we described; it is in a legitimate sense the most direct and fruitful 
generalization of ordinary, finite dimensional, Euclidean space. A linear 
functional on £2 is a real valued function A on £2 such that 


A(af + Bg) = aA(f) + BA(g) 


whenever a and 8B are real numbers and f and gare in £2. A linear functional A 
is bounded if there exists a positive, finite constant ¢ such that | A(/)| 
cll f lle for every f in 2. It is an elementary geometric property of £2 (whose 
proof depends on nothing deeper than that £2 is complete) that corresponding 
to every bounded linear functional A there exists an element g of £2 such that 
A(f) = ffedu for every f in £2. This fact may be used to prove the Radon- 
Nikodym theorem (of which, incidentally, it is in turn a reasonably easy conse- 
quence). For the sake of simplicity we shall restrict our outline of this proof 
to the case of finite measures. Suppose then that u and p are two finite measures 
such that y<< yp and writeAX =u +p. 


(3a) If, for every f in £20), ACY) = fre, then A is a bounded linear func- 
tional on £2(A). 
(3b) If ACS) = freer, then OS gS1{A]. (Hint: if f is the characteristic 


function of a measurable set E, then A(/) = v(E) S A(E).) 
(3c) If E = {x: g(x) = 1}, then A(EZ) = 0. (Hint: A(E) = v(Z).) 


(3d) fra — g)dv = ffedu for every non negative measurable function /. 


(3e) If go = = , then, for every measurable set E, v(E) = J, godu. (Hint: 


writes = -) 
(4) Suppose that (X,S,u) is a finite measure space and write, for any two real 
valued measurable functions f and g, 


polis) = f 15h, du. 


The function po is a metric; convergence with respect to po is equivalent to con- 
vergence in measure. 


§43. sET FUNCTIONS AND POINT FUNCTIONS 


In this section we shall study the connection between certain 
functions of a real variable and finite measures on the real line. 
Throughout this section we shall assume that 
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X is the real line, S is the class of all Borel sets, and yp is 
Lebesgue measure on S. 


We shall consider monotone, non decreasing functions f on X, 
i.e. functions f for which f(x) S f(y) whenever x < y; for brevity 
of expression we shall simply call such functions monotone. If / 
is a bounded monotone function, then it is easy to see that 


lim,.,-af(~) and lim,.,+.f(*) 
always exist and are finite; it is customary to denote these limits 
by f(~—) and (+) respectively. 
Theorem A. [fv is a finite measure on S and if, for every 
real number x, 
File) = v({t: —0 <t < x}), 
then f, is a bounded monotone function, continuous on the left 
and such that f,(—») = 0. 


Proof. The boundedness and monotoneness of f/, follow from 
the corresponding properties of ». Since f,(—z) = »((—%, —n)), 
n= 1,2, ---, it follows that 


SA(—-%) = lim, f(—7) = (NM n=o i aire a —n}) = 
= »(0) = 0. 
To prove that f, is continuous on the left at each x, suppose that 


{x,} is an increasing sequence of numbers such that lim, x, = x; 
we have 


= (Anas [xn,x)) = lim, v([xn,*)) = lima 0%) — fren). Ul 
The following result goes in the converse direction. 


Theorem B. If f is a bounded monotone function, continuous 
on the left and such that f(—«) = 0, then there exists a unique 
jinite measure v on S such that f = f,. 


Proof. In all details this proof parallels the construction of 
Lebesgue measure. If, in other words, we define » for every 
semiclosed interval by »([a,4)) = f(4) — f(a), then the results of 
§ 8 are valid for v in place of » and hence the extension theorem 
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13.A may be applied. The only argument which needs modifica- 
tion is the one used to establish 8.C. We are to prove that if 
[40,40) is a semiclosed interval contained in the union of a sequence 
{[a:,;)} of semiclosed intervals, then 


v([20540)) S Dims v(lai,%)). 


If ay = do, the result is trivial; otherwise let ¢ be a positive number 
such that ¢ < 4) — a. Since f is continuous on the left at a;, 
to every positive number 6 and every positive integer i there 
corresponds a positive number e; such that 


fle) — fla — 0) <3 F=12 


If Fo = [4,40 ~ | and U; = (a; = €;,4:), i= 1, 2, PeES then 
Fy Cc Usz1 U,, and therefore, by the Heine—Borel theorem, there 
is a positive integer 7 such that 


Fo Cc Ui-a Ui. 
From the analog of 8.B for » we obtain 
f(4o — €) — f(a0) S Lier (f(Gi) — fla — €:)' = 
= Dies (£6) — f(as)) + Lie Ya) — fla: — «)) S 
= Dies G) — f(a) + 3. 
Since ¢ and 6 are arbitrary, the desired result follows from the 
fact that f is continuous on the left at 4. J 
Theorems A and B establish a one to one correspondence be- 
tween all finite measures vy on S and some functions f, of a real 
variable; the following two theorems show how certain measure 


theoretic properties of »y may be characterized in terms of the 
corresponding function /,. 


Theorem C. If v is a finite measure on S, then a necessary 
and sufficient condition that f, be continuous is that v({x}) =0 
for every point x. 


Proof. If {x,} is a decreasing sequence of numbers such that 
lim, %, = *, then 


v({x}) = v((\rar [xyn)) = lim, o([x,%n)) = lima (Aen) — f(x). 
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The proof is completed by the observation that /, is continuous 
at x if and only if the last term of this relation vanishes. J 

A real valued function f of a real variable is called absolutely 
continuous if to every positive number ¢ there corresponds a 
positive number 6 such that 


Dies [AG — f(a) | < 


for every finite, disjoint class {(a;,4;): 1 = 1, ---, n} of bounded 
open intervals for which >\j., (4; — a:) < 6. 
Theorem D. If v is a finite measure on S, then a necessary 


and sufficient condition that f, be absolutely continuous is that 
v be absolutely continuous with respect to wu. 


Proof. If» «<4, then to every positive number e there corre- 
sponds a positive number 6 such that »(Z) < e for every Borel 
set E for which u(E) < 6. Hence if {(a;,b;): i = 1,%---, n} is a 
finite, disjoint class of bounded open intervals for which 


u(Uien [4;,0 i)) = Din (2 — aj) < 5, 


Y7. | AG) — f(a) | = S21 (a0) = 
r(Uia [2,,b;)) <é 


Suppose, conversely, that f, is absolutely continuous. Let e 
be any positive number and let 6 be a positive number such that 
Die: (4: — a) < Simplies 7.1 | f.() —fila)|<e If Eisa 
Borel set of Lebesgue measure zero, then there exists a disjoint 
sequence {[2;,4,)} of semiclosed intervals such that 


Ec Ui [aid and S071 (4; — 4) < 6. 


Since it follows that 07.1 | /,(4:) — f,(a,) : < ¢ for every positive 
integer 7, we have 


WE) S Diu (4,4) = Der | A) —f,(a) | S 


Since ¢ is arbitrary, we must have (EZ) = 0. Jf 

For the statement of the next result (which is an easy but 
frequently useful consequence of the Lebesgue decomposition 
theorem) we need one more definition. We shall say that a finite 


then 
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measure v on § is purely atomic if there exists a countable set C 
such that p(X — C) = 0. 


Theorem E. [f v is a finite measure on S, then there exist 
three uniquely determined measures v,, v2, and v3 on S whose 
sum is v and which are such that v, is absolutely continuous 
with respect to p, v2 is purely atomic, and vz is singular with 
respect to u but v3({x}) = 0 for every point x. 


Proof. According to the Lebesgue decomposition theorem 
(32.C) there exist two measures vp and »; on S whose sum is » 
and which are such that v9 is singular and », is absolutely con- 
tinuous with respect to uw. Let C be the set of those points x 
for which vo({x}) # 0; the finiteness of » implies that C is count- 
able. If we write 


v(E) =» (ENC) and »3(F) = (FE —C), 


then it is clear that the decomposition » = » + v2 + v3 has all 
the desired properties. Uniqueness follows from the uniqueness 
of the Lebesgue decomposition and the easily verifiable unique- 


nessofC. Jj 


(1) All the results of this section remain true for signed measures p if the condi- 
tion that f, be monotone is replaced by the condition that it be of bounded 
variation, (Hint: every function of bounded variation is the difference of two 
monotone functions.) 

(2) Several of the well known properties of monotone functions and absolutely 
continuous functions may be proved by using the methods of this section; we 
indicate two examples. 

(2a) A monotone function has (at most) countably many discontinuities. 
(Hint: for bounded monotone functions f which are continuous on the left and 
such that /(—%) = 0, apply Theorem B and the reasoning in the proof of 
Theorem C. The general case can be reduced to this special case by some obvious 
transformations.) 

(2b) If a bounded monotone function / is absolutely continuous and such 
that f(—%) = 0, then there exists a non negative Lebesgue integrable function 


@ such that f(x) = [7 oat). (Hint: apply Theorems B and D.) 


(3) The purpose of the following considerations is to show that the results of 
15.C and 15.1 can be extended to a very wide class of measures including the 
ones discussed in this section. 

(3a) If two finite measures wu and v on a o-ring S of subsets of X agree on a 
lattice L of sets in S, then w and v agree on the o-ring S(L) generated by L. 
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(Hint: if Ee L, FeL, and E C F, thenu(F — E) = o(F — E). Apply 5.2, 8.5, 
and 13.A.) 

(3b) If two finite measures uz and v are defined on the class of Borel subsets 
of a metric space X and agree on the class U of open subsets of X, then they agree 
for all Borel sets. 

(3c) If w is a finite measure defined on the class of Borel subsets of a metric 
space X, and U is the class of open subsets of X, then w(E) = inf {u(U): E 
Cc Ue} for every Borel set E. (Hint: the set function y* defined by y*(E) = 
inf {u(U): EC Ue} is a finite metric outer measure which defines a measure 
pv on the class of Borel sets, and v agrees with » on U.) 

(3d) If u is a measure on the class of Borel subsets of a metric space X, 
and C is the class of closed subsets of X that have finite measure, then u(E) = 
sup {u(C): ED CeC} for every Borel set E of o—finite measure. (Hint: it is 
sufficient to consider sets E of finite measure. Write »(F) = w(E NM F) and 
apply (3c) toy and X — E.) 

(3e) If 4 is a measure on the class of Borel subsets of a separable, complete, 
metric space X, and Cp is the class of compact subsets of X that have finite 
measure, then u(E) = sup {u(C): ED Ce Co} for every Borel set E of o—finite 
measure. (Hint: apply (3d) and 9.10.) 

(4) Ify is a finite measure on S and if a Borel set Ep is an atom of », then there 
exists a point x9 in Eo such that »(Ey — {xo}) = 0. (Hint: by means of (3) the 
general case may be reduced to the case in which Ep is closed and bounded.) 

(5) Ify is a finite measure on S, then a necessary and sufficient condition that 
J, be continuous is that » be non atomic. 

(6) Most of the results of this section remain true for measures and signed 
measures v which are not necessarily finite; what is essential is that »(E) be 
finite whenever E is a bounded interval. 

(7) In connection with (6) and for the purpose of constructing counter exam- 
ples, it is interesting to observe that there exist o—finite measures vy on S which 
are absolutely continuous with respect to », but for which »(Z) = © for every 
interval E with a non empty interior. (Hint: let f be a positive, Lebesgue 


. . € ele 
integrable function such that f f*du = © for every positive number ¢; for 
—e€e 


example write f(x) = (e!7!4/ [«|)— If {r1, re, ---} is an enumeration of the 
set of all rational numbers, if, for every x, 


as) = Daeagglle — 1, 


and if, for every Borel set E, »(E) = f g’du, then v has all the desired properties, 
EB 


Observe that since f gd =>. a Stew, the function gis finite valued a.e. [y].) 


Chapter IX 


PROBABILITY 


§ 44. HEURISTIC INTRODUCTION 


The purpose of this section is to give an intuitive justification 
for the measure theoretic treatment of probability. 

The principal undefined term in the theory of probability is 
“event.” Intuitively speaking, an event is one of the possible 
outcomes of some physical experiment. To take a rather popular 
example, consider the experiment of rolling an ordinary six-sided 
die and observing the number x (= 1, 2, 3, 4, 5, or 6) showing 
on the top face of the die. “The number x is even” —“‘it is less 
than 4”—“‘it is equal to 6”—each such statement corresponds to 
a possible outcome of the experiment. From this point of view 
there are as many events associated with this particular experi- 
ment as there are combinations of the first six positive integers 
taken any number at atime. If, for the sake of aesthetic complete- 
ness and later convenience, we consider also the impossible event, 
“the number x is not equal to any of the first six positive integers,” 
then there are altogether 2° admissible events associated with the 
experiment of the rolling die. For the purpose of studying this 
example in more detail let us introduce some notation. We write 
{2,4,6} for the event “x is even,” {1,2,3} for “x is less than 4,” 
and so on. The impossible event and the certain event (= 
{1,2,3,4,5,6}) deserve special names; we reserve for them the 
symbols 0 and X respectively. 

Everyday language concerning events uses such phrases as 


these: “two events E and F are incom atible or mutually exclu- 
Pp y 
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sive,” “the event E is the opposite of the event F or complemen- 
tary to F,” “the event E consists of the simultaneous occurrence 
of F and G,” and “the event E consists of the occurrence of at 
least one of the two events F and G.”” Such phrases suggest that 
there are relations between events and ways of making new events 
out of old that should certainly be a part of their mathematical 
theory. 

The notion of complementary event is probably closest to the 
surface. If F is an event, we denote the complementary event by 
E’: an experiment, one of whose outcomes is F, will be said to 
result in E’ if and only if it dges not result in Z. Thus if E = 
{2,4,6}, then EZ’ = {1,3,5}. We may also introduce combina- 
tions of events suggested by the logical concepts of “and” and 
“or.” With any two events E and F we associate their “union” 
EU F and their “intersection” EM F; here E U F occurs if 
and only if at least one of the two events E and F occurs, while 
E 1 F occurs if and only if both EF and F occur. Thus if & = 
{2,4,6} and F = {1,2,3}, then E U F = {1,2,3,4,6} and EN F 
= {2}. 

The considerations of the preceding paragraphs, and their 
obvious generalizations to more complicated experiments, justify 
the conclusion that probability theory consists of the study of 
Boolean algebras of sets. An event is a set, and its opposite event 
is the complementary set; mutually exclusive events are disjoint 
sets, and an event consisting of the simultaneous occurrence of 
two other events is a set obtained by intersecting two other sets— 
it is clear how this glossary, translating physical terminology into 
set theoretic terminology, may be continued. 

For the traditional theory of probability, concerned with simple 
gambling games such as the rolling die, in which the total number 
of possible events is finite, the above heuristic reduction of the 
class of all pertinent events to a Boolean algebra of sets is ade- 
quate. For situations arising in modern theory and practice, 
and even for the more complicated gambling games, it is neces- 
sary to make an additional assumption. This assumption is that 
the system of events is closed under the formation of countably 
infinite unions, or, in the technical language we have already used, 
that the Boolean algebra is in fact a o—algebra. 
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Perhaps an example, though a somewhat artificial one, might 
illustrate the need for the added assumption. Suppose that a 
player determines to roll a die repeatedly until the first time that 
the number showing on top is 6. Let E, be the event that the 
first 6 appears only on the ath roll. The event E = Ur. En 
occurs if and only if the game ends in a finite number of rolls. 
The occurrence of the opposite event EF’ is at Jeast logically (even 
if not practically) conceivable, and it seems reasonable to want 
to include a discussion of it in a general theory of probability. 
Numerous examples of this kind, together with some rather deep 
lying technical reasons, justify the statement that the mathemati- 
cal theory of probability consists of the study of Boolean o-—alge- 
bras of sets. 

This is not to say that all Boolean o-algebras of sets are within 
the domain of probability theory. In general, statements con- 
cerning such algebras and the relations between their elements 
are merely qualitative; probability theory differs from the general 
theory in that it studies also the quantitative aspects of Boolean 
algebras. We proceed now to describe and motivate the introduc- 
tion of numerical probabilities. 

When we ask “‘what is the probability of a certain event?”, 
we expect the answer to be a number, a number associated with 
the event. In other words, probability is a numerically valued 
function » of events £, that is of sets of a Boolean o-algebra. On 
intuitive and practical grounds we demand that the number z(F) 
should give information about the occurrence habits of the event 
E. If, in a large number of repetitions of the experiment which 
may result in the event E, we observe that E actually occurs only 
a quarter of the time (the remaining three quarters of the experi- 
ments resulting therefore in E’), we may attempt to summarize 
this fact by saying that h(E) = 3. Even this very rough first 
approximation to what is desired yields some suggestive clues 
concerning the nature of the function p. 

If, to begin with, u(Z) is to represent the proportion of times 
that E is expected to occur, then u(Z) must be a non negative 
real number, in fact a number in the unit interval [0,1]. If E 
and F are mutually exclusive events—say E = {1} and F = 
{2,4,6} in the example of the die—then the proportion of times 
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that the union E U F (= {1,2,4,6} in the example) occurs is 
clearly the sum of the proportions associated with EF and F 
separately. If an ace shows up one sixth of the time and an even 
number half the time, then the proportion of times in which the 
top face is either an ace or an even number is 4 + 3. It follows 
therefore that the function » cannot be completely arbitrary; 
it is necessary to subject it to the condition of additivity, that is 
to require that if E N F = 0, then u(E U F) should be equal to 
u(E) + w(F). Since the certain event X occurs every time, we 
should also require that n(X) = 1. 

We are now separated from the final definition of probability 
theory only by a seemingly petty (but in fact very important) 
technicality. If u is an additive set function on a Boolean 
o-algebra of sets, and if {Z,} is an infinite disjoint sequence of 
sets in the algebra, then it may or may not be true that 
urea: En) = Dori w(E,). The general condition of countable 
additivity is a further restriction on p—a restriction without 
which modern probability theory could not function. It is a 
tenable point of view that our intuition demands infinite additivity 
just as much as finite additivity. At any rate, however, infinite 
additivity does not contradict our intuitive ideas, and the theory 
built on it is sufficiently far developed to assert that the assump- 
tion is justified by its success. To sum up: 


numerical probability is a measure » on a Boolean o—algebra 
S of subsets of a set X, such that p(X) = 1. 


In our development of measure theory in the preceding chapters, 
the concepts “measurable function,” “integral,” and ‘product 
space” played important roles; in the immediately following 
paragraphs we shall introduce the probability meaning of these 
concepts. 

We begin with the frequently used term “random variable.” 
“A random variable is a quantity whose values are determined 
by chance.” What does that mean? The word “quantity” 
suggests magnitude—numerical magnitude. Ever since rigor has 
come to be demanded in mathematical definitions, it has been 
recognized that the word “variable,” particularly a variable whose 
values are ‘“‘determined” somehow or other, means in precise 
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language a function. Accordingly a random variable is a func- 
tion: a function whose numerical values are determined by chance. 
This means, in other words, that a random variable is a function 
attached to an experiment—once the experiment has been per- 
formed the value of the function is known. We have seen that 
the analytic correspondent of an experiment is a measure space 
X; it follows that a function of outcomes is a function of the 
points x of X. A random variable is a real valued function on the 
measure space X. 

The preceding sentence does not yet fully describe the cus- 
tomary usage of “‘random variable.” A function f on the measure 
space X is called a random variable only if probability questions 
concerning the values of f can be answered. An example of such 
a question is “what is the probability that f lies between a and 
B?” In measure theoretic language: ‘“‘what is the measure of the 
set of those points x for which the inequality a < f(x) S B is 
satisfied?” In order for all such questions to be answerable, it 
is necessary and sufficient that the sets that occur in them belong 
to the basic o—algebra S of X; in other words a random variable 
is a measurable function. 

Let us consider in detail the random variable f associated with 
an honest die by the definition f(x) = x. The possible values of 
f are the first six positive integers. The arithmetic mean of these 
values, that is the number 4(1 + 2+3+4+5 + 6), is of con- 
siderable interest in probability theory; it is called the average, 
or mean value, or expectation of the random variable f. If the 
die is loaded, so that the probability p, associated with x is not 
necessarily %, then the arithmetic mean is replaced by a weighted 
average; in this case the expectation of f is 1-p, +---+ 6-De. 
The analogs of such weighted sums, in cases where the number of 
values of the function need not be finite, are given by integrals; 
if the measurable function is integrable, then its expectation 
is by definition the value of its integral. 

We see thus that measurable functions and their integrals have 
their probability interpretations; in order to find such an inter- 
pretation of product spaces, we continue to study the example of 
the die. For simplicity we make again the classical assumption 
that any two faces are equally likely to turn up and that conse- 
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quently the probability of any particular face showing is 2. 


Consider the events E = {2,4,6} and F = {1,2}. The first 
Notion we want to introduce, the notion of conditional probability, 
can be used to answer such questions as these: “what is the 
probability of E when F is known to have occurred?” In the 
case of the example: if we know that x is less than 3, what can 
we say about the probability that x is even? The adjective 
“conditional” is clearly called for in the answer to a question of 
this type: we are evaluating probabilities subject to certain 
preassigned conditions. 

To get a clue to the answer, consider first the event G = {2} 
and ask for the conditional probability of E, given that G has 
already occurred. The intuitive answer is perfectly clear in this 
case, and is independent as it happens of any such numerical 
assumptions as the equal likelihood of the faces. If x is known 
to be 2, then x is certainly even, and the probability must be 1. 
What made the answer easy was the fact that G was contained in 
E. The general question of conditional probability asks us to 
evaluate the extent (measured by a numerical probability or 
proportion) to which the given event F is contained in the un- 
known event E. Phrased in this way, the question almost sug- 
gests its own answer: the extent to which F is contained in E 
can be measured by the extent to which E and F are likely to 
occur simultaneously, that is by n(E NF). Almost—not quite. 
The trouble is that u«(Z N F) may be very small for two reasons: 
one is that not much of F is contained in E, and the other is that 
there is not very much of E altogether. In other words it is not 
merely the absolute size of E N F that matters: it is the relation 
or proportion of this size to the size of F that is relevant. 

We are led therefore to define the conditional probability of 
E, given that F has already occurred, in symbols ur(£), as the 
ratio n(E N F)/u(F). For E = {2,4,6} and G = {2}, this gives 
the answer we derived above, ue(E) = 1; for E = {2,4,6} and 
F = {1,2}, we get the rather reasonable figure ur(E) = 3. In 
other words if it is known that x is either 1 or 2, then x is odd or 
even (i.e. equal to 1 or equal to 2) each with probability 4. 

Consider now the following two questions: “F happened, what 
is the chance of E?” and simply “what is the chance of E?” 
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The answers of course are ur(E) and u(£) respectively. It might 
happen, and it does happen in the example given above, that 
the two answers are the same, that, in other words, knowledge of 
F contributes nothing to our knowledge of the probability of EF. 
It seems natural in this situation to use the word “independent”: 
the probability distribution of E is independent of the knowledge 
of F. This motivates the precise definition: two events E and F 
are independent if ur(E) = u(Z). The definition is transformed 
into its more usual form and at the same time gains in symmetry 
if we recall the definition of up(Z). In symmetric form, E and F 
are independent in the sense of probability (statistically or 
stochastically independent) if and only if u(E NM F) = u(E)p(F). 

Suppose now that we wish to make two independent trials of 
the same experiment—say, for example, to roll an honest die 
twice in succession. In a compound experiment such as this one, 
we do not expect the reported outcome of the experiment to be a 
number, but rather a pair of numbers (%,x2). The points of the 
measure space associated with the compound experiment are, 
in other words, the points of the Cartesian product of the original 
measure space with itself; the problem is to determine how the 
probability is distributed among these points. For a clue to the 
answer, consider the events E = “x, <3” and F = “‘xg < 4.” 
We have n(£) = 4and w(F) = 4;if we interpret the independence 
of trials to mean the independence of any two events such as E and 
F, we should have p(E N F) = 2. 

On the basis of the preceding paragraph we shall say that, if 
the analytic description of an experiment is given by a measure 
space (X,S,u), then the analytic description of the experiment 
consisting of two independent trials of the given one is the 
Cartesian product of (X,S,u) with itself. 

What we can do once we can do again. Just as two repetitions 
of an experiment give rise to two dimensional Cartesian products, 
similarly any finite number of repetitions (say m) give rise to 
n-dimensional Cartesian products. The procedure can be ex- 
tended also to infinity: the analytic model of an infinite sequence 
of independent repetitions of an experiment is an infinite dimen- 
sional Cartesian product space. Even if an actually infinite 
sequence of repetitions of an experiment is practically unthinkable, 
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there is a point in considering infinite dimensional product spaces. 
The point is that many probability statements are assertions con- 
cerning what happens in the long run—assertions which can be 
made precise only by carefully formulated theorems concerning 
limits. Hence even if practice yields only approximations to 
infinity, it is the infinite sequence space that is the touchstone 
whereby the mathematical theory of probability can be tested 
against our intuitive ideas. 

We leave now these heuristic considerations and, in the next 
section, turn to the detailed investigation of the basic concepts 
and results of probability theory. 


§ 45. INDEPENDENCE 


A probability space is a totally finite measure space (X,S,u) 
for which p(X) = 1; the measure yw on a probability space is 
called a probability measure. 

If E is a finite or infinite class of measurable sets in a probability 
space (X,S,u), the sets of the class E are (stochastically) inde- 


pendent if 
u((Vier Ei) = T1i-1 w(Ed) 


for every finite class {E,;: 1 = 1, ---, a} of distinct sets in E. 
In case the class E contains only two sets E and F, the condition 
of independence is expressed by the equation 


u(E NF) = w(E)n(F). 


An illuminating example of two independent sets E and F is 
obtained by taking for X the unit square with Lebesgue measure, 
X={(xy): OS «$1,085 y S 1}, and writing E = {(x,y): 
OSxSl,asySb}andF= {(«,y):cSxSd08Sy ¥€ 1}, 
where a, 4, c, and d are arbitrary numbers in the closed unit 
interval. We remark that it is zot sufficient for the independence 
of the sets of a class E (even if E is a finite class) that any two 
distinct sets of E be independent. 

If & is a finite or infinite set of real valued measurable functions 
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on a probability space (X,S,u), the functions of the set & are 
(stochastically) independent if 


u(( Vint {xz fi(x) e M;}) = IIi-1 a({x: f(x) e M;}) 


for every finite subset {/;:7 = 1, ---, 2} of distinct functions in 
& and every finite class {M;: i = 1, ---, m} of Borel sets on the 
real line. An equivalent way of expressing this condition is to 
say that if, for each f in &, M; is a Borel set on the real line, then, 
for every possible choice of the Borel sets M;, the sets of the class 
E = {f'(M,):f © &} are independent. An illuminating example 
of two independent functions f and g is obtained by taking for 
X the unit square, as in the preceding paragraph, and defining 
f and g by f(x,y) = ~ and g(x,y) = y. 

As our examples of independent sets and functions might 
indicate, there is a very close connection between the concepts 
of stochastic independence and Cartesian product. Suppose, in 
fact, that /; and f are two independent functions on a probability 
space (X,S,u) and consider the transformation T from X into the 
Euclidean plane, defined by 


T(x) = il), A). 


If measurability in the plane is interpreted in the sense of Borel, 
then the facts that X is a measurable set and f; and fp are measur- 
able functions imply that T is a measurable transformation; simi- 
larly, of course, the functions /, and /2 are themselves measurable 
transformations from X into the real line. A direct comparison 
with the definition of independence shows that the fact that f 
and fo are independent can be very simply expressed by the 
equation 
eT = wh X ufo. 


(If the transformation T is denoted, as it may well be, by the 
symbol f, X fo, then the last written equation takes the form of 
an easily remembered distributive law.) If the functions g, and 
go on the plane are defined by 


&i(Yy2) = 1 and gel yi,¥2) = Yes 
then it is easy to verify that f, = g,T and fp = g2T. From these 
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very simple considerations we may already draw a non trivial 
conclusion. 


Theorem A. Jf f, and fy are independent functions, neither 
of which vanishes a.e., then a necessary and sufficient condition 
that both f, and fz, be integrable is that their product f, fz be 
integrable; if this condition is satisfied, then 


fifads = ffidw- [fady. 


Proof. Using the notation established above, we see (by 39.C) 
that the integrability of | f;| is equivalent to the integrability of 
gi, i = 1, 2, and, by Fubini’s theorem, the integrability of | g; | 
and | go | implies that of | g:g2|. Conversely, of course, if | gg | 
is integrable, then almost every section of | gige| is integrable. 
Since each such section is a constant multiple of [ g, | or of | go | 
and since the assumption that /, and f2 do not vanish a.e. implies 
that these constant factors may be selected to be different from 
zero, it follows that the integrability of | gig2| implies that of 
|g, | and | g.|. Since, finally, another application of 39.C shows 
that | gigo| is integrable if and only if | /,f2| is integrable, the 
assertion concerning integrability is proved. The multiplicative 
relation follows from Fubini’s theorem. Jf 

The use of product spaces in the study of independent functions 
extends far beyond the simple case indicated by the reasoning 
above. Suppose, for instance, that {/,} is a sequence of inde- 
pendent functions and let Y be the Cartesian product of a sequence 
of real lines in each of which measurability is interpreted in the 
sense of Borel. If, for every x, 


T(x) = (A(x), fo), _ Ds 


then T is a measurable transformation from X into Y; a neces- 
sary and sufficient condition that the functions /, be independent 
is that wT! = wf! X wf X---. If the functions g, on Y 
are defined to be the coordinate functions, ga(yi, Ye) °°*) = Ins 
n=1,2,---, then f/f, = gaT, m = 1,2, ---. Similar results are 
true of course for arbitrary (finite, countable, or uncountable) 
sets of functions. 
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Theorem B. If {fij:i = 1, -+-, ky jf = 1, +--+, mi} ts @ set 
of independent functions, if o; is a real valued, Borel measurable 
function of n; real variables,i = 1, ---, k, and if 


Fix) = bil fa(*), ca *sFins(*)), 
then the functions fi, +++) fx are independent. 


Proof. The theorem is an easy application of the relations 
established above between product spaces and independence. 
Suppose that Y;; is the real line, 7 = 1, ---, 4,7 = 1, ---, 2, and 
Y = Xi; Viz If we write 


T(x) _ (fir(x), = “5 Sial)s ee *, fur (x), as “Stag ®))s 


Sii( Iu 8s Yams o> kts 89 Vim) = Nii 
and 


&i = b:i( Zits +, Sins)» 

then f; = g:T, i = 1, ---, k. Since the independence of the g,’s 
is obvious, the independence of the f;’s follows. JJ 

We conclude this section by introducing a frequently used 
notation of probability theory. If / is a real valued measurable 
function on a probability space (X,S,u), such that /? is integrable, 
then it follows from Schwarz’s inequality (i.e. Holder’s inequality 
with p = 2, cf. 42.A) that / itself is integrable and that, in fact, 


2 
(rau) < [7?du. 
If f du = a, then the variance of f, denoted by o?(/), is defined 


by o7(/) =f — a)*du. Since the integral of a constant func- 


tion over a probability space is equal to the value of the constant, 
it follows from the definition of a, by multiplying out the last 
written integrand, that 


an = (frau) - ( fre); 


it is clear that, for any real constant c, o?(cf) = c?o*(/). 
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Theorem C. Jf f and g are independent functions with a 
Jinite variance, then 


o(f +g) = o(f) + 0°(g). 
Proof. We have 


e+e) =f + ode - (futon) = 
= frau + 2ftedn + fetdu — (fran) 
as 2( frau) (sau) - (ecu) 


the desired result follows from Theorem A. §f 


(1) If Fis a measurable set of positive measure in a probability space (X,S,x), 
and if, for every measurable set E, ur(E) = u(F NM E)/u(F), then ur is a proba- 
bility measure on S such that ur(F) = 1; the sets E and F are independent if 
and only ifur(E) = w(Z). The number ur (Z) is called the conditional probability 
of E given F. 

(2) If (Ej: ¢ = 1, ---, n} isa finite class of measurable sets of positive measure, 


then 
ME; a) eee nN E,) _ w(E)) ue, (Ee) oo PEine-n En-1(En)- 


This result is known as the multiplication theorem for conditional probabilities. 
(3) If {Ei: @ = 1, ---, n} is a finite, disjoint class of measurable sets of posi- 

tive measure whose union is X (i.e. if {£;} is a partition of X), then, for every 

measurable set F, u(F) = > 9.1 u(E:)ue(F), and, if F has positive measure, 


mr (E;) = w(E;)ue(F)/>t-1 w(Es)ue,(F). 


This result is known as Bayes’ theorem. 

(4) Two partitions of X, say {E;: 7 = 1,---, n} and {Fp j = 1,---, m}, are 
called independent if u(E; N Fj) = w(E)u(F;) for i = 1, +++, 2 ‘anti j=l, 
+++, m. Two sets E and F are independent if and only if the partitions { E,E’ i 
and {F,F’} are independent. 

(5) Let X = {x: OS * S 1} be the unit interval with Lebesgue measure. 
For every positive integer 7 define a function a on X by ee Sn(x) = +1 


aa 
S*x< i is odd or even. 


The functions f, are called the Rademacher eet Any two of the functions 
Si, fe, and fife are independent, but the three together are not. 
(6) If f and g are independent integrable functions, if Mf is a Borel set on the 


real line, and if E = f-'(M), then Sedu =f fas: { gdu. (Hint: observe that 


or —1 according as the integer i for which + 
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x(x) = xu(/(x)) and apply Theorem B to show that the product of f and xu(/) 
is independent of g.) 
(7) Iff and g are measurable functions with finite variances such that o(/)e (g) 


~ 0, their coefficient of correlation is defined by 
Jfedu = Stew f gdp. 
ris) = o(f)o(g) ° 


where o(f) = V/o%( J) is the standard deviation of f. The functions / and g are 
uncorrelated if r(/,z) = 0. Iff and g are independent, then they are uncor- 
related. A necessary and sufficient condition that o?(f + g) = o?(f) + 07(g) 
is that f and g be uncorrelated. 

(8) Is it true that if f and g are uncorrelated, then they are independent? 
(Hint: let X be the unit interval and write f(x) = sin 2mx, g(x) = cos 21x.) 

(9) If f and g are independent integrable functions such that (f + g)? is 
integrable, then /? and g? are integrable. 


§ 46. SERIES OF INDEPENDENT FUNCTIONS 


Throughout this section we shall work with a fixed probability 
space (X,S,u). Our first result is known as Kolmogoroff’s in- 
equality. 


Theorem A. /f f;,7 = 1, ---, 2 are independent functions 
such that { fad =O, and [f2dy oe a Te 


A(x) = Utes | SEA) | Ge. f ts the maximum of the abso- 


lute values of the partial sums of the fs), then, for every positive 
number e, 


1 
w({x:|f() | 2 ) S dhe (fi). 
Proof. We write 
E={«lf@®|2e, «6 = Die 
Ey = {x:| se(x) | 2 ef 1 Aisice {uz | se(x) | < ef. 
We have 


and 


f Sn°dp. = [ s2du + p(E;) DS acies fidu = 
ke 


Ex 


2 f Se’dp = w(E,)e*. 
Et 
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Since E = Uti, Ex and since the sets Ey are disjoint, it follows 
that 


Sites fe) = fi tt fade = f sata = 
= >t, if Sedu = Vir wWE)e = We | 


Theorem B. Jf {f,} is a sequence of independent functions 
such that f frdu =0 and S221 07(fn) < ©, then the series 


Deni fa(x) converges a.e. 
Proof. If we write 
sax) = SL A(x), 2 = 1,2, °°, 
@m(x) = sup {| Sm4e(*) — Sm(x) [2k = 1,2, ---}, 
a(x) = inf {a,(x): m = 1,2, ---}, 


then a necessary and sufficient condition for the convergence of 
Drei fn(x) at x is that a(x) = 0. By Kolmogoroff’s inequality 
we have, for every positive number ¢ and every pair of positive 
integers m and n, 


u({x: Ui-1 | Sm4k(x) a Sm(x) | Pa el) Ss 3 Din o*(fi)s 


and therefore 


a({x: an(x) 2 e}) S ; 


lie m+1 a” (fx). 


It follows that 
1 
u({x: a(x) 2 el) S g dita o*( fi) 


and therefore, using the convergence of )o7.; 07(fn), that 
u({x: a(x) 2 e}) = 0. The desired result is implied by the 
arbitrariness of e. 
The next result goes in the converse direction. 
Theorem C. Jf {fn} is a sequence of independent functions 


and ¢ is a positive constant such that f frdu = Oand|fa(x)| Se 
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a.e.,n =1,2,--++, and if Drei fn(x) converges on a set of posi- 
tive measure, then 


en a7 (fn) < o. 


Proof. If so(x) =0 and sa(x) = Diehi(x), m= 1, 2, +, 
then Egoroff’s theorem implies (cf. 21.2) that there exists a posi- 
tive number d such that the set 


E = (\reo {x: | sn(x) | S a} 
has positive measure. If we write 
En = (Veo {x: | s(x) | Sd}, 2 =0,1,2, +9, 
then {£,,} is a decreasing sequence of sets whose intersection is E. 
If F, = En, — En, nm =1, 2, ---, and a, = [i sds, n=0, 
1, 2, ---, then = 


Qn — An} =f, Sid -f Snap -f Snap == 
En-1 Fa En-1 


=, 72742 f Ve ae f Sadie Speeds 5%, 
En-1 EZ, Fa 


m1 


Since 


Sd = w(E,_-1)07(fn) and FS nSn—14 oa 0, 


Ent En—1 


and since p(E,-1) 2 w(E) and | sa(v)| Sc+d for x in F,, 
n = 1,2, ---, it follows that 


Qn — On = u(E)o* (fa) — (C+ d)*u(Fa), 1 = 1,2, ---. 
Summing over ” from 1 to & we obtain 
a? > wlEy)d? = oy = w(E) Dia Pn) — (+4). | 


We remark that Theorems B and C imply that if {f,} is a 
uniformly bounded sequence of independent functions such that 


f frdp = 0 for n = 1, 2, «++, then the series 07.1 fn(x) either 


converges a.e. or diverges a.e.; the measure of the convergence 
set 1s always one of the extreme values, 0 or 1. 
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Theorem D. Jf {f,} is @ sequence of independent functions 
and ¢ is a positive constant such that |f,(x)| Sc a.e..n = 1, 
2, +++, then Dorerfn(x) converges a.e. if and only if both the 


series Dy2o4 fad and 5°01 07(fn) are convergent. 


Proof. The “if” follows by applying Theorem B to the se- 
quence {g,} defined by ga(x) = f(x) — f Fndp, n =1, 2, ---. 
To prove the converse, we consider the Cartesian product of the 
space X with itself and on it the functions 4, defined by /,(x,y) = 


a(x) — fa(y), 2 = 1,2, ---. Since the convergence a.e. of 
Dre1/n(x) implies that [2-1 An (x,y) converges a.e., and since 


Phadlu Xu) = 0, 


it follows from Theorem C that (721 07(4n) < ©. Since, how- 
ever, o7(hn) = 207(f,), we see that Sor. 07(fn) <0. Since 
o*(gn) = (fn), it follows from Theorem B that So72; gn(«) 
converges a.e. and therefore the relation 


fad = fa(x) a gnlx), a= I; 2 aon 


implies the convergence of >>°.1 f an | 


All our preceding results on series are included in the following 
very general assertion, known as Kolmogoroff’s three series 
theorem. 


Theorem E. If {f,} is a sequence of independent functions 
and c is a positive constant, and if E, = {x: |fnr(x)| Sc}, 
n=1,2, ---, then a necessary and sufficient condition for the 
convergence a.e. Of Dinnrfiu(x) is the convergence of all three 
series 


(a) Tees u(En’), 
(b) Eo, fi fot 
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Ex sem-(fsa) 


Proof. If we write 
CS fa(x) i (| fale) | So, 
Cc 


f(x) = and h,(x) = es LL Face) | SF 


then it is clear that the series 


> einen; Sonni En(*)5 and rn An(x) 


converge at the same points. It follows from Theorem D (applied 
separately to {gn} and {/,}) that )(%2,/n(x) converges a.e. 
if and only if all four series 


(a) vai (ft fad a cuEn!)) 
Co) Daa( [Arde (ff fotu) + eaan(Fad 
 2eu(En!) fac) 


are convergent. It is readily verified that the convergence of (d) 
and (e) (with all choices of the ambiguous signs) is equivalent 
to the convergence of (a), (b), and (c). All that the verification 
requires, in addition to the obvious additions and subtractions, 
is the remark that, since the terms of a convergent series are 
bounded, the termwise product of two convergent series one of 
which has non negative terms is convergent. J 


(1) The following result, which is implicitly contained in our earlier discus- 
sion of the relation between mean convergence and convergence in measure, 
is known as Tchebycheff’s inequality. If / is a measurable function with finite 
variance, then, for every positive number e, 


w(le: [4ls) — fidu| 2 4d) s hor). 


Kolmogoroff’s inequality for » = 1 reduces to Tchebycheff’s. Since, in the 
notation of Theorem A, 


{x: | f(x) | 2 e} os Ute {x: | Die file) | 2 e}, 


an application of Tchebycheff’s inequality separately to each partial sum yields 


u({as [/(e) | 2 €}) Sy Uke b+ Do). 
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(2) An interesting special case of Theorem D is obtained by considering the 
sequence {fn} of Rademacher functions; (cf. 45.5). If {¢n} is a sequence of real 
numbers, then }-n=1¢nfn(*) converges or diverges a.e. according as the series 
DFa1¢n® converges or diverges. In the language of probability: a necessary 
and sufficient condition for the convergence with probability 1 of the series 
Dai + en is the convergence of D’R21 én’, it being understood that + and — 
are equally likely and that the ambiguous signs are determined independently 
of each other. 

(3) The fact that the convergence set of a series of independent functions 
must have measure 0 or | is a consequence of the following very general princi- 
ple, known as the zero-one law. Suppose that the probability space X is the 
Cartesian product of a sequence { X,} of probability spaces. If, for each positive 
integer n, Jn = {2 +1, » +2, ---}, and if a measurable set E in X is a 
J,-cylinder for every n, then n(Z) = 0 or 1. (Hint: write, for every measur- 
able set F, o(F) = n(E NF). If F is a J-cylinder for a finite set J, then 
»(F) = p(E)u(F); since a finite measure on the class of all measurable subsets 
of X is uniquely determined by its values on such cylinders, this relation remains 
valid for E in place of F.) 

(4) If {E£,} is a sequence of independent sets, then u(lim sup, En) = O if 
and only if Done1m(En) < ~; (cf. 9.6). (Hint: let x, be the characteristic func. 
tion of E,, and apply Theorem D to the sequence {xn}.) This result is known 
as the Borel—Cantelli lemma. 

(5) Two sequences { f,} and {g,} are equivalent in the sense of Khintchine if 


Daw wf x: fax)  gn(x)}) < 0. 


If {fn} is a sequence of independent functions, then a necessary and sufficient 
condition for the convergence a.e. of the series )-n=1/n(*) is the existence of an 
equivalent sequence {gn} of independent functions with finite variances such 


that the series (7. fend and )°%_10°(gn) are convergent. 


(6) If {fn} is a sequence of integrable functions and if f is a measurable 
function with finite variance such that, for every positive integer n, the functions 


A, ss hif 7 a +++++ fn) 


are independent, then each f, has finite variance and the series 
Dias Vale) — ffadu) 


converges a.e. (Hint: apply 45.9 and the three series theorem.) 


§ 47. THE LAW OF LARGE NUMBERS 


There are several limit theorems in the theory of probability 
which are collectively known as the law of large numbers; in this 
section we present two typical ones. The first of these is known 
as Bernoulli’s theorem or the weak law of large numbers. 
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Theorem A. If { f,} is a sequence of independent functions 
with finite variances, such that f frdup =0,n = 1,2, +++, and 


ee | 1 

lim, — Dile1 0? (f) = 0, then the sequence {= Dei | of aver- 
n n 

ages converges to 0 in measure. 


Proof. Since o* is homogeneous of degree 2 and, for inde- 
pendent functions, additive, we have 


J (- Sr sf) =o: (- Dif) = 5 Tn ef). 


In other words, the principal assumption of the theorem is equi- 
valent to the assumption that the sequence of averages converges 
to zero in the mean of order two (i.e. converges to zero in the 
space £2), and this implies convergence in measure. 

Two real valued measurable functions f and g on a probability 
space (X,S,u) have the same distribution if u(/—'(M)) = u(g7"(M)) 
for all real Borel sets M. It is easy to verify that if f and g are 
two integrable functions with the same distribution and if F = 


f—'(M) and G = g~(M), for some real Borel set M, then f Jap = 
F 


f gdu. An interesting special case of Bernoulli’s theorem is the 
G 


one in which every two terms of the sequence { /,} have the same 
distribution. In this case o?(f/,) = o7(/,) for every positive integer 


1 : 
n, and hence — 33741 0°(f,) = — 0°(/,), so that the assumption on 
n n 


the asymptotic behavior of {o?(/,,)} is automatically satisfied. 

As auxiliary propositions for the proof of a sharper form of 
the law of large numbers we need the following two results from 
elementary analysis. 


Theorem B. Jf {yn} is a sequence of real numbers which 


converges to a finite limit y, then lima — Dsta1. 9: = Y. 
n 


Proof. Corresponding to every positive number e, there exists 


gone ‘ € 
a positive integer 7 such that ifm > m, then| y, — y| < x Let 
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n, be a positive integer greater than mp and such that 


1 € 
pases no Coos ~e 
n D1 ly: y| < 2 


If n > m, then 


(7 Stay.) -y1 = 1 Dt eI 


1 1 
= [> te (y: — ¥) | + |= Ditenett (ys-y) |< 


n— No 


1 
<= lily — | + <e J 
1 


€ 
2 
Theorem C. Jf { yn} is a sequence of real numbers such that 


: ‘ maw 
the series >.221—yn is convergent, then lim, — y2-1y; = 0. 
n n 


Proof. We write 


1 
so = 0, In = > ae 7°? tn = re Ms a= 1, 2; Boe 
Since y; = i(s; — 5:1), 7 = 1,2, +--+, and 
faa = Sti iss es is;s1= — Soul 53+ (n + 1)Sn4ay 
n=1,2,- 


it follows that 


i a a 


Since the sequence {s,} converges to a finite limit, and since, by 


1 fo8 
Theorem B, the sequence { an si converges to the same limit, 
n 


we have 
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Theorem D. Jf { /,} is a sequence of independent functions 
with finite variances, such that| frdu = 0, n = 1,2, ---, and 


oe 7(fn) 
n=l nt 


almost everywhere. 


1 
< «, then the sequence {= i fl converges to 0 


We remark that the hypothesis and, correspondingly, the con- 
clusion of Theorem D are stronger than those of Theorem A. 
The present theorem is one form of the strong law of large num- 
bers. 


: 1 
Proof. We write g,(x) = 7 fale), n=1, 2, ---, and apply 
46.B to the sequence { gn}. Since f gndu = 0,” = 1,2, ---, and 


o(fn) 


nat 0 ea) = Donat i) < 9, 


it follows that the series 


1 
aS —fn(x) 


n 


converges a.e.; the desired result follows from Theorem C. 


(1) Two measurable functions have the same distribution if and only if they 
have the same distribution function; (cf. 18.11). 

(2) If {o;2} is a sequence of non negative real numbers and if m and n are 
Positive integers such that m <n, then 


Ou ESO neg ts ah pes Omi a oo 
n n 


This inequality can be used to show that the assumptions of Theorem D are 
not weaker than those of Theorem A. That they are properly stronger may be 
shown by constructing a sequence {fa} of independent functions for which 
ary  2tl 
a Ls Sag ery are 1) 
(3) Theorem D is the best possible result of its kind (involving restrictions on 
o*(f,) only) in the following sense: if {on2} is a sequence of non negative real 


a ; : 
numbers such that }\".1—5 = ©, then there exists a sequence {fn} of inde. 
n 


pendent functions such that [fads =0, (fr) = 0,2, n=1, 2, +++, and 
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{t > ft does not converge toOa.e, (Hint: construct f, so that if o,? S n?, 
then ; 
u({x: fal) = n}) = (Ler fale) = —m}) = 55, 
2 
u({etfa(x) = 0}) = 1-55, 
and if o,? > n?, then 


mfx: falx) = ont) = wf: f(x) = —on}) = 4. 


Observe that if lima i191 = 0, then lim, 2 yn = 0, and apply the Borel- 


Cantelli lemma to {x: | f.(x) | 2 2}. 

(4) If {fn} is a sequence of independent functions satisfying the conditions 
of Theorem D, then there exists an equivalent sequence {gn} of independent 
functions such that 


De 1 o*( 89) =e 


n* , 


in other words, the converse of the strong law of large numbers is not true. 
(5) The following weak converse of the strong law of large numbers is true. 
If {fn} is a sequence of independent functions and c is a positive constant, such 


1 .p fl 
that f fadu = 0 and |p fa(*) |Scae,n=1, 2, ---, and if G 2 Ji} con- 
verges to 0 a.e. then P 
.) o ( n. 
Dun-1 (fa) < 

n 

for every positive number «. (Hint: if { yn} is a sequence of real numbers such 
. 1 1 F 

that lim, - >?_ 1.91 = 0, or even such that the sequence \- a x} is bounded, 


then the series >>. an is convergent for every positive number e.) 


(6) The conclusion of Theorem D remains true if the assumption f frau = 0, 


n = 1,2, +++, is replaced by lima” Stes ffidu =0. 


(7) The following is another theorem which is sometimes known as the strong 
law of large numbers. If {fn} is a sequence of independent integrable functions 


with the same distribution, such that f fadu = 0, then lima = Des fi = 0 ae. 

The sequence of assertions below is designed to lead up to a proof of this result. 
1 

(7a) If En = {x: |fi(x) | Sn}, then Tei ff fidu <o. (Hint: let 


P : F 1 
Xn be the characteristic function of En, and write g = Dinwi 5 Xah. If 


k—-1<lAt)|s k, then xn(*) = 0 whenever # < &, and this implies by an 
elementary computation that | g(x) | <2|/i(«)| and hence that g is integrable.) 
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“7b) If Fa = {x: |fa(x)| Sm} and if gn = xr,fn, then the sequence {gn} 
of independent functions is equivalent to {fn}. 

sa, eee : s 

(7c) Tima = OP, f gdu = 0. (int: f edie f. Sd = f, Sidu and {Ei} is 

an increasing sequence of measurable sets whose union is X; (cf. Theorem B).) 
o | ; 

(7d) Soren 737 (En) <o, (Hint: observe that f gn’du =f, fedu =f, fide 

and apply (7a). This establishes the convergence of 21 a f Sn'du; the con- 


1 2 
vergence of one 52 ( fi suds) follows from the relation 


(Sent) s (flisle)) 


(8) The following converse of the version of the strong law of large numbers 
stated in (7) is true. If {f,} is a sequence of independent functions with the 


ee ae | er 
same distribution such that lima = Dt ifi = 0 ae., then f, is integrable. 


(Hint: the relation lim, 1 Jn = 0 a.e., together with the Borel-Cantelli lemma, 


implies the convergence of the series >-2-1 u({x: | fn(x) | > m}). Observe that 
w(Lx: | fa(x) | > 2}) = w({x: | f(x) | > 2}) and apply 27.4.) 

(9) Applying the strong law of large numbers to the Rademacher functions 
we obtain the celebrated theorem of Borel on normal numbers: almost every 
number in the unit interval has in its binary expansion an equal number of 0’s 
and 1’s. Similar considerations are valid with respect to any other radix r in 
place of 2 (r = 3), and yield the theorem concerning absolutely normal numbers: 
almost every number is normal with respect to every radix simultaneously. 


§ 48. CONDITIONAL PROBABILITIES AND EXPECTATIONS 


If E and F are measurable subsets of a probability space 
(X,S,u) such that u(F) #0, we have defined the conditional 
probability of Z given F by the equation 


ur(EZ) = wW(E Nn F)/u(F) 


(cf. §44 and 45.1), and we have investigated slightly its de- 
pendence on E. We are now interested in the way in which ur(Z) 
depends on F. If Fis such that both u(F’) and u(F”) are different 
from 0, we introduce a measurable space Y consisting of exactly 
two points yi and ye (with the understanding that every subset 
of Y is measurable), and a measurable transformation T from X 
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into Y, defined by T(x) = y1 or ye according asxeF orxeF’. 
If for every subset 4 of Y we write 


ve(4) = w(EN T“(A)) and »(4) = vx(4) = w(T(A)), 


then, clearly, 


ur(E) = a(t ys) and yp (E) = vet ¥2}) 


v({ y1}) v({ yo}) 


In other words conditional probability may be viewed as a measur- 
able function on Y—that function which is, roughly speaking, the 
ratio of the two measures vg and ». 

Generalizing the considerations of the preceding paragraph, we 
may consider a finite, disjoint class {Fi, ---, F,} of measurable 
sets of positive measure such that (?_, F; = X, and correspond- 
ingly we may introduce a measurable space Y of 7 points y, 
--+, yn. If T(x) = y; whenever x e F;, i = 1, +--+, 2, then Tis a 
measurable transformation from X into Y, and once more we may 
represent conditional probabilities as ratios of two measures on 
Y. These considerations motivate the following general defini- 
tion. If Tis any measurable transformation from the probability 
space (X,S,u) into a measurable space (Y,T), and if we write 
ve(F) = w(E N T-'(F)) whenever E and F are measurable sub- 
sets of X and Y respectively, then it is clear that yg and wT 

= vx) are measures on T such that vg <uT. It follows from 
the Radon—Nikodym theorem that there exists an integrable func- 
tion pz on Y such that 


WEN TF) = S pe(y)duT—(y) 


for every F in T; the function pz is uniquely determined modulo 
pT, We shall call pz(y) the conditional probability of E given 
y or the conditional probability of Z given that T(x) = y. Some- 
times we shall use the phrase ‘the conditional probability of E 
for a given value of T(x)” to refer to the number pz(T(x)). We 
shall generally write p(Z,y) for pz(y); on the occasions when it 
is necessary to consider p as a function of its first argument we 
shall write p¥(Z) = p(EZ,y). 
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If F is such that n(T7(F)) # 0, we may divide the equation 
which defines p by u(T~!(F)) and obtain the relation 


“EN THF) : 
TY EY SPM) 
Since the extreme left term of this relation is the conditional proba. 
bility of E given T-(F), it is formally plausible that as “F 
shrinks to y,” the left term should tend to the conditional proba- 
bility of Z given y and the right term should tend to the integrand 
p(E,y). The use of the Radon—Nikodym theorem is a rigorous 
substitute for this rather shaky “‘difference quotient” approach. 


Theorem A. For each fixed measurable set E in X, 
0 < p(Ey) $1 WT; 


LT-\(F) (E) = 


for each fixed disjoint sequence {E,} of measurable sets in X, 


P(Ur-1 Ensy) Se yak p(E,,y) {uT~']. 


Proof. The inequality is an immediate consequence of the 
fact thatO S$ un(E N T"(F)) S u(Z) for every measurable subset 
Fof Y. To prove the equation, observe that 


S oUt: Emy)duT“(y) = w((Uter Ex) N T-(F)) = 
25a RIA) 2 i p(Eny)duT-(y) = 


= f Lr PEny))deT 70), 


and apply the uniqueness assertion of the Radon—Nikodym 
theorem. J 

Theorem A asserts that p” behaves in certain respects like a 
measure. It is easy to obtain more evidence in this direction 
and to prove, for instance, that p(X,y) = 1 [u7~], that if EZ, c Ep, 
then p(E,,y) S p(Eo,y) [uT—1], and that if {E,} is a decreasing 


sequence of measurable sets in X, then 


2(Nrer En,y) = lim, P(Eny) {uT~]. 
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It is important, however, to remember that the exceptional sets 
of measure zero depend in each of these cases on the particular 
sets E; under consideration, and it is in general incorrect to con- 
clude that p” is a measure for almost all values of y. 

The defining equation of p(E,y) may also be written in the form 


= —1 - i 
Fogg xe auto) J e@odaer (y). If, more generally, f is 


any integrable function on X, then we may consider its indefinite 
integral », defined by 


oF) =f, fleddu(e), 


for all measurable sets F in Y, as a signed measure on T. Since 
clearly » <yuT™, it follows from the Radon—Nikodym theorem 
that there exists an integrable function e; on Y such that 


force f@ 4a) = fe rdduT-(y) 
TP) P 


for every F in T; the function e is uniquely determined modulo 
uT~!, We shall call e(y) the conditional expectation of / given 
y; we shall also write e(f,y) instead of e;(y). 

Since the relation between p and ¢ is similar to the relation 
between a measure and an indefinite integral, it might seem that 
some such equation as 


ey) = freeddpn(x) 


ought to hold. Since, however, p¥ is not in general a measure, 
the right term of this equation is undefined; the misbehavior ot 
» is reflected, slightly enlarged, in the misbehavior of e. 


Theorem B. If f is an integrable function on Y, then fT 
is an integrable function on X and e(fT,y) = f(y) [uT~'J. 


Proof. It follows from 39.C that fT is integrable and that 
fi f(TODdu(x) = f fo)duTA(y) for every Fin T. | 

T"\"(F) F 

(1) Suppose that (X,S,u) and (Y,T,v) are probability spaces and consider 


their Cartesian product (X X Y,S X T,u Xv). If T(x,y) = +, then T is a 
measurable transformation from X X Y onto X. For every measurable set E 
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in X X Y, p(E,x) = v(E;) [uy], and hence, in this case, pz may indeed be defined 
for each E so that p* is a measure for every x. 
(2) Suppose that (X,S,u) and (Y,T,v) are probability spaces and let be a 


probability measure on S X T such that A <u X », say A(Z) =f fau Xx v). 
If T(x,y) = x, then, for every measurable set EZ in X X Y, 


P(E») = fel rYle.y)r(9) lab 


(3) If T is a measurable transformation from a probability space (X,S,u) 
into a measurable space (Y,T), then p(T-'(F),y) = xr(y) [uT~] for every 
measurable set F in Y. 

(4) The purpose of the following considerations is the construction of an exam- 
ple for which the conditional probabilities p(Z,y) cannot be determined so that 
p” is a measure for almost every y. Let Y be the closed unit interval, let T 
be the class of all Borel subsets of Y, and let vy be Lebesgue measure on T. Write 
X = Y and let S be the o-ring generated by T and a set M such that both M@ 
and M’ are thick in Y. A probability measure » is unambiguously defined on S 


by writing 
u((4 NM) U (BN M) = (4) 


whenever 4 and B are in T; we consider the transformation T from X onto Y 
defined by T(x) = x. Suppose that there exists a set Co of measure zero in T 
such that p¥ is a measure on S whenever y e’ Cp. 

(4a) If Do = { y: p(M,y) 1}, then »(Do) = 0. 

(4b) If Eo is the set of those points y for which it is not true that p(T-!(F),y) 
= xr(¥y) identically for all Fin T, then y(Ey) = 0. (Hint: let R be a countable 
ring such that S(R) = T. If, for each FinR, 


Eo(F) = {y: p(T-(F), 9) ¥ x(a}, 


then v(E)(F)) = 0. Make use of the fact that if two probability measures 
agree on R, then they agree on T also.) 

(4c) If ye’ Cy U Do U Ep, then y e M. (Hint: the relations p(M,y) = 1 and 
p(T ({_¥}),y) = 1, together with the fact that pY is a measure, imply that 


p(M 1 T-\(fy}),y) = 1.) 


Since (4c) implies that the Borel set Co’ M Do’ M Eo’ of measure 1 is con- 
tained in the set M, we have derived a contradiction with the assumption that 
M’ is thick. 

(5) If X is the real line and yw is a probability measure on the class S of all 
Borel sets in X, and if T is a measurable transformation from X into a measur- 
able space (Y,T), then the conditional probabilities p(Z,y) may be determined 
so that p¥ is a measure for almost every y. (Hint: write g(x,7) = p((—%,x),y). 
There exists a measurable set Co in Y such that ¢T—'(Cy) = 0 and such that if 
y ©’ Co, then g¥ is a monotone function on the set of all rational numbers in X 


: 1 : 
and, moreover, limn g¥ (« - -) = q(x) for every rational number x, Let q# 


be a left continuous monotone function on X which agrees with g¥ for rational 
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values of x and let 5” be the measure on S determined by the conditions 
BY((—~%,x)) = G(x); write B(Z,y) = BY(E).) 

(6) If T is a measurable transformation from a probability space (X,S,u) 
into a measurable space (Y,T), and if it is possible to determine the conditional 
probabilities p(E,y) so that p¥ is a measure for almost every y, then 


ey) = [f(e)dem(x) WT] 


for every integrable function f on X. (Hint: the relation is true if f is the char- 
acteristic function of a measurable set.) 

(7) If Tis a measurable transformation from a probability space (X,S,u) into 
a measurable space (Y,T), and if f and g are integrable functions with respect 
toy and wT respectively, such that the function 4 defined by A(x) = /(x)g(T(x)) 
is integrable on X, then 


e(A,y) = e(hy)e(y) (HT. 


§ 49. MEASURES ON PRODUCT SPACES 


Does there exist a sequence of independent random variables 
with prescribed distributions? More precisely, if {u,} is a se- 
quence of probability measures on the Borel sets of the real line, 
does there exist a probability space (X,S,u) and a sequence { fn} 
of independent functions on X such that p(fn'(E)) = un(E) 
for every Borel set E and every positive integer 2? More gen- 
erally, if {(Xn,Sn,un)} is a sequence of probability spaces, does 
there exist a probability space (X,S,u) and, for each positive 
integer m, a measurable transformation T, from X into 
XX X+++X& Xp, such that wT,7) = ws X+++X wa? The affirma- 
tive answers to these questions are given by 38.B. 

It is important for the purposes of probability theory to intro- 
duce the concept of independence, and, at the same time, to 
emphasize that it is not the general case. The main purpose of 
this section is to formulate and prove a theorem which does for 
dependent random variables what 38.B did for independent ones— 
a theorem, in other words, which asserts that there always exists 
a sequence of random variables with prescribed joint distributions. 
Unlike 38.B, however, the theorem of this section will apply to 
the case of uniformly bounded, real valued functions only; in 
other words, the components of the product space which we shall 
treat are all unit intervals. The result and its proof extend to 
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more general cases, which, however, all have in common the fact’ 
that they depend on topological concepts. This peculiar and 
somewhat undesirable circumstance appears to be unavoidable; 
it is known that the general measure theoretic analog of Theorem 
A below is not true. 

Suppose that, for each positive integer 7, X,, is the closed unit 
interval and S, is the class of all Borel sets in X,, and write 
(X,S) = Krai (Xn,Sn). Let Fx be the o-ring of all measurable 
{1, «++, m}-cylinders in X and let F (= Uti Fn) be the ring of 
all measurable, finite dimensional subsets of X; (cf. § 38). 


Theorem A. Jf p is a set function on F such that, for each 
positive integer n, 41s a probability measure on F,, then u has a 
unique extension to a probability measure on S. 

Proof. We define a measurable transformation 7,,, from X onto 
the measurable space Y, = X7?21 Xi, by 


T(x, Thy Xny Xn415 a = (x1, paar: Xa)s i= 1, 2; mh, 


and we write, for every measurable subset 4 of Y,, vn(4) = 
u(T,~'(A)). If {E,} is a decreasing sequence of sets in F such 
that 0 < e S$ w(E,), i = 1, 2, ---, then, for each fixed i, there is 
a positive integer 7 and a Borel set 4; in Y, such that E; = 
Tn '(4;). Let B; be a closed subset of 4; such that »n(4; — B;) 


< a . If F; = T,71(B,), then F; is a compact subset of the 


product space X (in its product topology) and u(Z; — F;) S o 


If G, = ()f-; Fi, then {G,} is a decreasing sequence of compact 
subsets of X. Since 


w(Ey -— Gr) = w(Uiei (Ee — Fi) S u(UE (Ai -— F)) S 
it follows that 
m(G:) = (E,) ~ Ex - Ge) 25, 


€ 
2? 


and hence that G, # 0,k = 1,2, ---. Since a decreasing sequence 
of non empty compact sets has a non empty intersection, it follows 
that mw is continuous from above at 0 and hence countably addi- 
tive; the desired result follows from 13.A. J 
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Retaining the notation established above we proceed to the 
proof of an interesting property of product spaces of the type 
discussed in Theorem A. 


Theorem B. For every measurable set E in X, 
lim, p(E,Tn(~)) = xe(x) [ols 


in other words, the conditional probabilities of E, for given 
values of the first n coordinates of a point x, converge (except 
perhaps on a set of x’s of measure zero) to 0 or 1 according 
asxeEorxe’ E, 


Proof. It is convenient to prove almost uniform convergence 
instead of almost everywhere convergence—it follows, of course, 
from 21.A and 21.B that the two are equivalent. Let ¢ and 6 
be any two positive numbers and suppose that <1. By 13.D 
there exists a positive integer 7 and a measurable {1, ---, mo}- 


3 
cylinder Ey such that p(EA E&) < ae We write B= EA Ey 
and we observe that if x e’ B, then 
xe(x) = Xx,(*). 


If Cr a {xs ?(B,T,(*)) 2 5}, Dd, =, C, = Wize C:, n= 1s 2, 
---> and C = UR: Ca = Une: Dn, then, for each 2, C, and D, 
are measurable {1, ---, m}-cylinders. It follows that 


u(BN D,) = fi P(B,Ta(x))du(x) = bu(D,), 


and hence that 


o> u(B) = w(B NC) = WBN UsrD,) = 


Dore e(B N D,) = 8 58-1 w(D,) = 
bu(Lra1 Dn) = d(C). 


6 
If we write 4 = B UC, then u(4) S a <e. Since 


| P(E, Tn(x)) ~ p(Eo,Ta(x)) | S P(E A Fo,Tn(x)) (ul, 


m=1,2,-%, 
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we may assume that these relations are valid for every x in X. 
If n = no, then it follows from 38.A and 48.B that 


| p(E,Tn(*)) — xe() | S p(B,T.(x)). 


If, in addition, x e’ 4, then, in the first place xz,(x) = xz(x) and, 
in the second place p(B,Tn(«)) < 4, so that | p(Z,7,(*)) — xe(x) | 
<6 § 


(1) Suppose that {(Xy,S.,Hn)} is a sequence of probability spaces, (X,S) = 
Xin1 (XnSn), and u is a set function on F such that, for each positive integer 7, 
is a probability measure on F,. If, on each Fy, u is absolutely continuous with 
respect to the product measure Xfa1p:, then » has a unique extension to a 
probability measure on S. (Hint: cf. the proof of 38.B.) The result and the 
method of proof extend to all cases in which the conditional probabilities 
pP(E,T,(x)) may be determined so that for almost every fixed x they define 
probability measures on each Fy. 

(2) The statement and the proof of Theorem A remain correct if the spaces 
Xn, ate compact metric spaces. It follows, by a trivial compactification, that 
Theorem A is true if each Xq is the real line. Does it remain true for arbitrary 
compact spaces? 

(3) Retaining the notation of (1), we proceed to give an example to show that 
Theorem A is not necessarily true if the spaces Xn are not intervals. Let Y 
be the unit interval, T the class of all Borel subsets of Y, and » Lebesgue measure 
on T. Let {X,} be a decreasing sequence of thick subsets of Y such that 
Nei X_ =0. WriteS, = TN X,; if EeS,, so that E = FN X, with F 
in T, then write za(E) = v(F). Form the product space (X,S) = Krai (Xn,Sn); 
and, for each positive integer 7, let S, be the measurable transformation from 
X, into X; X---& Xn defined by Sa(xn) = (21, -+*, Zn), 22 = Xn? = 1, +++ m 

(3a) For each measurable {1, ---, }-cylinder E in X, 


(E = AX Xana X Xanga K++, AeSi X-°& Sy), 


write u(E) = pn(Sn~'(4)). The set function » is thereby unambiguously de- 
fined on F and, for each fixed positive integer m, u is a probability measure on F,. 

(3b) If E; is the set of all those points (x1, x2, --+) in X whose first i coordi- 
nates are all equal to each other, # = 1,2, ---, then Z;eF; (Hint: if 


D; = {(91, 0+ Ht =o = Dh, 


then D; is a measurable subset of the ‘dimensional Cartesian product of Y 
with itself, and 


E; = (Di 1 (X1 X+++& X)) XK Xa K Xige Kee) 


(3c) The set function pz on F is not continuous from above at 0. (Hint: con- 
sider the sets E,, ¢ = 1, 2, ---, defined in Gb), and observe that u(Z,) = 1 
and aber E; = 0.) 

(4) The zero—one law (46.3) is a special case of Theorem B. Indeed if EZ is a 
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J,-cylinder and if F is a measurable subset of Y,, then Tn~1(F) is a {1, ---, n}~ 
cylinder and 


wE NT.) = WET.) = f u(E)dt, 


and therefore p(E,T,(x)) is a constant (= u(E)) almost everywhere [uJ]. It 
follows from Theorem B that x2(x) = u(Z) [x], and hence that u(Z) is either 
O or 1. 


Chapter X 


LOCALLY COMPACT SPACES 


§ 50. TOPOLOGICAL LEMMAS 


In this section we shall derive a few auxiliary topological 
results which, because of their special nature, are usually not dis- 
cussed in topology books. 

Throughout this chapter, unless in a special context we explicitly 
say otherwise, we shall assume that X is a locally compact 
Hausdorff space. We shall use the symbol ¢ for the class of all 
real valued, continuous functions f on X such that 0 S f(x) S 1 
for all x in X. 


Theorem A. /f C is a compact set and U and V are open 
sets such that CC U U PV, then there exist compact sets D and 
E such that DCU, ECV,andC=DUE. 


Proof. Since C — U and C — F are disjoint compact sets, 
there exist two disjoint open sets 0 and 7 such that C—- Uc 0 
and C-V CP; we write D=C—O and E=C-P. It 
is easy to verify that D Cc U, E c PV, and that D and E are com- 
pact; since 0 N 7 = 0, we have DU E = (C— 0) U (C— P) 
=C-—-(O0nNPA=C. | 


Theorem B. Jf C is a compact set, F is a closed set, and 
C1 F = 0, then there exists a function f in & such that f(x) = 0 
for x in C and f(x) = 1 for x in F. 


Proof. Since X is completely regular, corresponding to each 
point y in C there exists a function f, in § such that f,(y) = 0 and 


f,(«) =1 for x in F. Since the class of all sets of the form 
216 
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{x: f,(x) < 4}, y in C, is an open covering of C, and since C is 
compact, there exists a finite subset { y,, ---, yn} of C such that 


Co Uh {«:4,.(%) < ¥}. 


If we write g(x) = []i-1/,,(), then g eS; since OS f(x) S 1 
for all x in X and all y in C, it follows that g(x) < 4 for x in C 
and g(x) = 1 forxin F. It is easy to verify that if f = (2g — 1) 
U 0, then f e §, f(x) = 0 for x in C, and f(x) = 1 forxin FF. §j 

It is sometimes relevant to know not only whether or not a 
function f (in &) can be found which vanishes on C, as in Theorem 
B, but also whether or not it may be chosen so as not to vanish 
anywhere else. The answer is in general negative; the following 
theorem contains some of the pertinent details. 


Theorem C. /f f is a real valued continuous function on 

X and ¢ is a real number, then each of the three sets 

{xi f(x) 2 ch, {xs fe) Sch, and {x: f(x) = ¢} 

is a closed Gs. If, conversely, C is a compact G3, then there 

exists a function f in & such that C = {x: f(x) = 0}. 

Proof. Since {x: f(x) 2c} = {x: —f(~) S —c} and since 
{x: f(x) = c} = fx: f(x) 2c} N {x: f(x) Sc}, it is sufficient to 
consider the set {x: f(x) Sc}. The fact that this set is closed 

1 
(and that, for every » = 1, 2, ---, the set {xs AG) <et+ -} 


is open) follows from the continuity of f; the fact that it is a 
Gs is shown by the relation 


{xi f(x) Sc} = Per {fl <et+ “|. 
n 


Suppose, conversely, that C = (}%.1 Un, where C is compact 
and {U,} is a sequence of open sets. For every » = 1, 2, ---, 
there exists a function f, in § (Theorem B) such that f,(x) = 0 
for x in C and f,(x) = 1 for x in X — U,. If we write f(x) = 

1 
nat dafn(*)s then fe and f(x) = 0 tor « in C. For any 


xin X — C there exists at least one positive integer ” for which 
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xeX — U,; it follows that, for x in X — C, f(x) = 5a fale) = 


1 
oa > 0, and therefore that C = {x: f(x) = 0}. J 

Theorem D. If C is compact, U is open, and Cc U, then 
there exist sets Cy and Uy such that Cy is a compact Gs, Up is 
@ o-compact open set, and 


Cc UcGcU.z. 


Proof. Since there exists a bounded open set V such that 
CcV CU, there is no loss of generality in assuming that U 
is bounded. Let f be a function in § such that f(x) = 0 for x 
in C and f(x) = 1 for x in X — U, (Theorem B); write 


Uo = {x: f(x) < 3} and Cy = {x:f(x) S }}. 


Clearly CC Up CCy Cc U and, by Theorem C, Cy is a closed 
G;. The fact that Cy is compact follows from the boundedness of 
U; the fact that U) is o-compact is shown by the relation 


1.4 
Uo = Ure [=f@) = a =i: | 
Theorem E. Jf X is separable, then every compact subset 
C of X is aGs. 


Proof. If a point x of X is not in C, then there exist two 
disjoint open sets U(x) and V(x) such that C C U(x) and x ¢ V(x). 
Since X is separable and since the class { V(x): x e’ C} is an open 
covering of X — C, there exists a sequence {x,} of points in X 
such that 

X-CcULAY Os). 
It follows that 


Niter U(%n) DCD Aner (X — Ven) D Near Ulen). 


(1) An alternative proof of Theorem B may be given by introducing the one- 
point compactification of X and using the known fact that every compact 
Hausdorff space is normal, and that therefore if C and D are two disjoint closed 
subsets of a compact Hausdorff space, then there exists a function f in § such 
that f(x) = 0 for x in C and f(x) = 1 for ¥ in D. 
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(2) Theorem C may be applied to prove the result, which is also easy to prove 
directly, that the class of all compact Gs’s is closed under the formation of finite 
unions and countable intersections. 

(3) If X* is the one-point compactification, by x*, of an uncountable discrete 
space X, then the one-point set {x*} is a compact set which is not a Gs. 

(4) Let J be an arbitrary uncountable set; for each i in J, let X; be the (com- 
pact Hausdorff) space consisting of the two real numbers 0 and 1, and let X 
be the Cartesian product Ki Xi. 

(4a) Every one-point set in X is a compact set which is not a G3. 

(4b) We call a subset E of X an No-set if there exists a countable set J in J 
such that E is a J—cylinder; (cf. 38.2). A compact set Cin X is a Gs if and only 
if it ig an No-set. (Hint: if C is compact, U is open, and C C U, then, by the 
definition of topology in X, there exist a finite subset J of J and an open set Up 
which is a J—cylinder such that CC U C UV.) 

(4c) If f is any real valued continuous function on X and M is any Borel set 
on the real line, then f~!(M) is an No~set. 

(5) Let X* and Y* be the one-point compactifications (by «* and y*) of a 
countably infinite and an uncountable discrete space, respectively. The subsets 


({x*} ve fa {(x*,y*)} and (X* X {y¥*}) = {(x*,7*)} 


of the locally compact Hausdorff space (X* X Y*) — {(x*,y*)} may be used to 
show that Theorem B is false if C is not required to be compact. 
(6) The class of all c—-compact open sets is a base; (cf. Theorem D). 


§ 51. BOREL SETS AND BAIRE SETS 


The relations between measurability and continuity are most 
interesting, and have been studied most, in locally compact 
spaces. We continue with our study of a fixed locally compact 
Hausdorff space X; in the present section we shall introduce the 
basic concepts and results of a theory of measurability in X. 

We shall denote by C the class of all compact subsets of X, 
by S the o-ring generated by C, and by U the class of all open sets 
belonging to S. We shall call the sets of S the Borel sets of X, 
so that, for instance, U may be described as the class of all open 
Borel sets. A real valued function on X is Borel measurable 
(or simply a Borel function) if it is measurable with respect to 
the o-ring S. 


Theorem A. Every Borel set is o-bounded; every o—-bounded 
open set is a Borel set. 


Proof. Every compact set is trivially bounded and therefore 
o—bounded. The class of all o—bounded sets is a o-ring; since this 
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o-ring includes C, it contains every set of the o-ring generated 
by C. 

Suppose, conversely, that U is open and that {C,} is a sequence 
of compact sets such that 


Uc Us-1 Cn = K. 
Since, for m = 1, 2, ---, C, — U is compact, it follows that 
D = Ure (Ci, — U) & 8; 


since D = K — U, it follows that U = K—(K—U)eS. J§ 

We shall denote by Co the class of all those compact subsets 
of X which are G;’s, by So the o-ring generated by Co, and by Up 
the class of all open sets belonging to Sp. We shall call the sets 
of So the Baire sets of X,so that, for instance, Uy may be de- 
scribed as the class of all open Baire sets. A real valued function 
on X is Baire measurable (or simply a Baire function) if it is 
measurable with respect to the o-ring Sp. 

On first glance it might appear that the Borel sets are the 
obvious objects of measure theoretic investigation in locally 
compact spaces. There are, however, several natural reasons 
for the introduction of the apparently artificial concept of Baire 
set. First: the theory of Baire sets is in some respects simpler 
than the theory of Borel sets, and knowledge about Baire sets 
frequently provides a successful tool for dealing with Borel sets; 
(cf. § 63). Second: the study of Baire sets is connected with the 
reasonable requirement that the concept of measurability in X 
should be so defined as to ensure that every continuous function 
(or at least every continuous function which vanishes outside 
some compact set) is measurable; (cf. Theorem B below). Third: 
the class of all Baire sets plays a distinguished role, in that it is 
the minimal o-ring which contains sufficiently many sets to de- 
scribe the topology of X; (cf. Theorem C below). Fourth: in 
all classical special cases of the theory of measure in topological 
spaces (e.g. in Euclidean spaces) the concepts of Borel set and 
Baire set coalesce; (cf. 50.E). 


Theorem B. [Jf a real valued, continuous function f on X is 
such that the set N(f) = {x: f(x) ¥ 0} 15 o-bounded, then f is 
Baire measurable, 
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Proof. If ac—-bounded open set U is an F,, then there exists a 
sequence {C,} of compact sets such that U= U721C,. By 
50.D, for each positive integer 7 there exists a compact Baire 
set D, such that C, CD, c U. It follows that U = U7.,D 
and hence that U is a Baire set. The assumptions on f anply 
that, for every real number c, the set N(f) N {x: f(x) <c} isa 
o—bounded open set which is an F,. Jj 


Theorem C. [f B is a subbase and if § is a o-ring contain- 
ing B, then S D So. 


Proof. If Cis a compact set and U is an open set containing 
C, then there exists a set E which is a finite union of finite inter- 
sections of sets of B (and which therefore belongs to $) such that 
CcEcU. Hence, if C = (\%21 Un, where each U, is open, 
then, for every 7 = 1, 2, ---, there exists a set E, in S such that 
C CE, C Un; it follows that C = ()%-, E, €S. Since we have 
thus proved that Cy) cS, the desired result follows from the 
definition of Sy. J 

The class of Baire sets was defined to be the o-ring generated 
by the class of compact G;’s; it appears conceivable (though upon 
reflection somewhat improbable) that a compact set may be a 
Baire set without being a G;, i.e. that compact sets other than the 
generating ones manage to get intoSo. The purpose of the follow- 
ing theorem is to show that this does not happen. 


Theorem D. Every compact Baire set is a G,. 


Proof. Let C be a compact set in So; by 5.D, there exists a 
sequence {C,} of sets in Co such that C belongs to the o-ring 
S({C,}). By 50.C, for every n = 1, 2, ---, there exists a func- 
tion f, in § such that C, = {x: fn(~) = 0}. If for each pair, x 
and y, of points in X we write 


d(x,v) = ae | fa(x) — fay) | 


then d(x,x) = 0, d(x,y) = d(y,x), and 0S d(x,y) S d(x,z) + 
d(z,y). It follows that if we write x = y whenever d(x,y) = 0, then 
the relation “=” is reflexive, symmetric, and transitive, and 
therefore an aguivalenve relation; we denote the set of all equiv- 
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alence classes by =. For every x in X we write = T(x) for the 
(uniquely determined) equivalence class which contains «. 

If T(x.) = T(y1) and T(x2) = T(yz) (1c. if x1 = y1 and x2 = yo), 
then 


d(x1,x%2) S d(x1,91) + d(yi,¥2) + d( y25X2) = d(y15¥2)5 


and, by symmetry, d(y1,y2) S d(x1,%2), so that d(%1,%2) = d(y1,y2). 
This means that if & = T(x,) and & = T(x) are two elements of 
&, then the equation 6(&,&) = d(x,,x2) unambiguously defines 
the number 6(&,é&). Since 6(£,,) = 0 implies that & = &, 
the function 6 is a metric on Z. If & = T(x) is any point of the 
metric space &, if 7p is any positive number, and if E = {&: 6(&,&) 
< ro}, then T(E) = {x: d(xo,x) < ro}; since d(xo,x) depends 
continuously on x, this proves that T is a continuous transforma- 
tion from X onto &. 

A subset of X is the inverse image (under T) of a subset of & 
if and only if it has the property that it contains, along with any 
of its points, all points equivalent to that one (i.e. if and only 
if it is a union of equivalence classes). Since each C,, has this 
property, since the class of all inverse image sets is a o-ring, and 
since Ce S({C,}), it follows that there exists a subset I of = 
with T71(T) = C. Since T(T—(T)) = T, since T is continuous, 
and since C is compact, it follows that is compact. Since every 
closed (and therefore every compact) subset of a metric space is 
a Gs, there exists a sequence {A,} of open subsets of = with 


T 7 a ees An: 


If we write U, = T“(A,), n = 1, 2, ---, then C = ()7_,U,; 
since, by the continuity of T, U,, is open, it follows that CeCy. 


Theorem E. Jf X and Y are locally compact Hausdorff 
spaces, and if Ao, Bo, and So are the o-rings of Baire sets in 
X, Y, and X X Y respectively, then So = Ao X Bo. 


Proof. If 4 and B are compact Baire sets in X and Y respec- 
tively, then 4 X B is a compact G;, and hence a compact Baire 
set in X X Y. Since Ay X Bo is the o-ring generated by the class 
of all sets of the form 4 X B, it follows that Ap X By C Sp. 
If U and V are open Baire sets in X and Y respectively, then 
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U X V ce Ay X Bo. Since the class of all sets of the form U X V 
is a base for X X Y, it follows from Theorem C that Ap X By D 
So. I 


We conclude this section by stating, for the purpose of refer- 
ence, an easily verified theorem related to the generation of Borel 


sets and Baire sets; (cf. 5.2 and 5.3). 


Theorem F. The class of all finite, disjoint unions of proper 
differences of sets of C [or of Co] is a ring; the o-ring it generates 
coincides with § [or, respectively, with So}. 


(1) The definition of Borel set for the real line, when it is considered as a 
locally compact space, is equivalent with the definition in § 15. 

(2) The entire space X is a Borel set if and only if it is o~compact. 

(3) The o-ring generated by the class of all bounded open sets, or, equivalently, 
the o-ring generated by U, coincides with S. (Hint: for every compact set C, 
let U be a bounded open set containing C, and consider U ~— (U — C).) 

(4) If X is the product space of 50.4, then the class of Baire sets coincides 
with the class of measurable sets, as defined in § 38. 

(5) The o-ring generated by the class of all bounded open Baire sets, or, 
equivalently, the o-ring generated by Uo, coincides with So. (Hint: if C is 
compact, U is open, and CC U, then there exists a bounded open Baire set 
Uy such that CC Up C U.) 

(6) The term “Baire set” is suggested by the term ‘“‘Baire function” as used 
in analysis. If @ is the smallest class of functions which contains all continuous 
functions and contains the limit of every pointwise (but not necessarily uni- 
formly) convergent sequence of functions in it, then the functions of @ are called 
the Baire functions on X. A necessary and sufficient condition that a set be a 
Baire set is that it be a Borel set and that its characteristic function be a Baire 
function. 

(7) Every Boolean g—algebra is isomorphic to the class of all Baire sets, 
modulo Baire sets of the first category, in a totally disconnected, compact 
Hausdorff space. (Hint: cf. 40.15c and observe that the o-ring generated by 
the class of all open-closed sets in a totally disconnected, compact Hausdorff 
space coincides with the class of all Baire sets.) 


§ 52. REGULAR MEASURES 


A Borel measure is a measure y» defined on the class S of all 
Borel sets and such that u(C) < for every C in C; a Baire 
measure is a measure po defined on the class Sp of all Baire sets 
and such that po(Co) < © for every Co in Co. 

Several aspects of the theories of Borel measures and Baire 
measures are so similar to each other that it is worth while to 
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develop them simultaneously; for this purpose we adopt the fol- 
lowing notational device. Throughout this section we shall use 
¢, 0, and § to stand either for C, U, and S or else for Co, Uo, 
and So, respectively, and we shall study a measure f which is a 
Borel measure if 8 = S and a Baire measure if S = Sp. 

A set E in § is outer regular (with respect to the measure Q) if 


A(E) = inf {a(U): Ec Ue}; 
a set E in § is inner regular (with respect to f) if 
A(E) = sup {a(C): ED C eC}. 


A set E in § is regular if it is both inner regular and outer regular; 
a measure ff is regular if every set EZ in § is regular. 

Loosely speaking, a measure is regular if all its values may be 
calculated from its values on the topologically important com- 
pact sets and open sets; if it is desired that the measure theoretic 
structure of X be not completely unrelated to its topological 
structure, the condition of regularity is a natural one to impose. 
The measure theoretic behavior of a non regular set is very 
pathological. 

It is easy to verify that if E eS and A(E) = », or if Ee DB, 
or if E is the intersection of a sequence of sets of finite measure 
in U, then E is outer regular. Dually, if EeS and @(E) = 0, 
or if Ee 6, or if E is the union of a sequence of sets in C, then 
E is inner regular. Our first purpose in the sequel is to show that 
the regularity of certain sets implies the regularity of many others. 
The motivation of the particular steps in the proof is furnished 
by 51.F; we progress from compact sets to their differences, and 
from differences to unions of differences. After that we shall show 
that the class of regular sets has sufficient closure properties to 
justify the application of the theorem on the monotone class 
generated by a ring, and thus we shall obtain the conclusion that 
certain measures are necessarily regular. 


Theorem A. If every set in © is outer regular, then so is 
every proper difference of two sets of C; if every bounded set in 
C is inner regular, then so is every proper difference of two sets 
of C. 
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Proof. Let C and D be two sets in € such that CoD. If 
C is outer regular, then, for every « > 0, there is a set Uin 0 such 
that CCU and a(U) Sa(C) +. Since C-DCU-—DeO, 


the relations 


a(U — D) — a(c — D) 


A((U — D) — (C — D)) = 
A(U — C) = aU) - a(C) S 


imply that C — D is outer regular. 

To prove the assertion concerning inner regularity, let U be a 
bounded set in 0 such that Cc U. If the bounded set U — D 
(in 0) is inner regular, then, for every ¢ > 0, there is a set E 
in € such that Ec U — D and a(U —D)S a(E) +. Since 
C-D=CN(U-D)ACN Ec, the relations 

atc — D) - aC N FE) = a((C-—D) - (CN £)) = 
=a(C~D)-£)s 
S$ A((U-D)—- £)= 
= #(U —D)— aE) Se 
imply that C — Disinner regular. Jf 


Theorem B. 4 finite, disjoint union of inner regular sets of 
finite measure is inner regular. 


Proof. If {Fi, ---, E,} is a finite, disjoint class of inner reg- 
ular sets of finite measure, then, for every « > 0 and for every 
i = 1, ---, 2, there exists a set C; in C such that 


C;C E; and (EZ) S$ a(C) + = 
If C= ULC; and E= U21E, then E>CeC, and the 
relations 
ACE) = Soi. A(Es) S D1 a(C) +e = aC) + 


imply that Z isinner regular. Jf 
It is easy, but unnecessary, to prove the analogous result for 
outer regular sets; the following theorem is much more inclusive. 
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Theorem C. The union of a sequence of outer regular sets is 
outer regular; the union of an increasing sequence of inner 
regular sets is inner regular. 


Proof. If {E;,} is a sequence of outer regular sets, then, for 
every « > O and for every i = 1, 2, ---, there exists a set U; 
in 0 such that 


E;cU; ana p(U,) S$ A(E) + a 


we write U= (7, U.. If E = Uf, E; and a(Z) = ~, then 
E is trivially outer iar if A(E) < «, then 


a(U) — A(Z) = a(U — E) s aU (U, - £)) 
S D1 aU; — BE) = Li, @(U) — a(Z)) Ss 


If {E,} is an increasing sequence of inner regular sets and 
E = U2. E,, we make use of the relation 


A(E) = lim, A(Z)). 


Weare to prove that, for every real number ¢c withe < f(E), there 
is a set E in € such that CC E andc < a(C). To prove this, we 
need only select a value of 1 so that ¢ < A(E,), and then, using 
the inner regularity of E;, find a set C in C such that Cc E; 
andc < Aa(C). J 


Theorem D. The intersection of a sequence of inner regular 
sets of finite measure is inner regular; the intersection of a 
decreasing sequence of outer regular sets of finite measure is 
outer regular. 


Proof. If {E;} is a sequence of inner regular sets of finite 
measure, then, for every e > 0 and for every i = 1, 2, ---, there 
exists a set C; in C such that 


C:cE; and aE) $ aC) +5; 
we write C= ()2,C;. If E = ()2, £,, then ED Ce € and 


A(ZE) — a(C) = AE -C) S a(U21(E:-C)) Ss 
S Di-1 A(E; — C) = Dis (@(E) — a(C)) Se. 
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If {Z,} is a decreasing sequence of outer regular sets of finite 
measure and E = ()j., E;, we make use of the relation 


A(E) = lim, A(E,). 


We are to prove that, for every real number ¢ with ¢ > a(E), 
there is a set U in O such that EF C U andc > @(U). To prove 
this we need only select a value of ? so that c > A(E,) and then, 
using the outer regularity of E,, find a set U in © such that 
E;c Uand @(U)<c ff 

The duality between inner and outer regularity is even more 
thoroughgoing than is indicated by the similarities among the 
above proofs; we proceed to prove that the two kinds of regularity 
are essentially the same. 


Theorem E. 4 necessary and sufficient condition that every 
set in © be outer regular is that every bounded set in © be inner 
regular. 


Proof. Suppose that every set in C is outer regular, let U 
be a bounded set in 0, and let ¢ be a positive number. Let C 
be a set in € such that U c C; since C — U is compact and be- 
longs to §, it follows from 51.D that C — Ue €, and therefore 
that there exists a set V in 0 such that 


C-UCV and a(VY) ea(C—U)+e 
Since U = C — (C— U) DC—V 6, the relations 
AU) -a(C-V)=an0U-(C-P))=nUNY)S 

< av —(C-U)) =a) - aC - U) Ss 


imply that U is inner regular. 

Suppose next that every bounded set in 0 is inner regular, let 
C bea set in C, and let € be a positive number. Let U be a bounded 
set in 0 such that C c U; since U — C is a bounded set in 0, 
there exists a set D in € such that 


DcU-C and a@(U—C)S aD) +6 
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Since C = U—~ (U—C) CU— De, the relations 
ACU — D) — a(C) = a((U — D) — C) = a((U — C) — D?) 
= a(U—C)—Aa(D) Se 


imply that C is outer regular. J 


Theorem F. Either the outer regularity of every set in © or 
the inner regularity of every bounded set in 0 is a necessary and 
sufficient condition for the regularity of the measure f. 


Proof. The necessity of both conditions is trivial. To prove 
sufficiency, i it is enough (Theorem C) to prove that every bounded 
set in § is regular, since every set in § is the union of an increas- 
ing sequence of bounded sets in §. Let Eo be a bounded set in § 
and let Cy be a set in € such that Eo C Cy. By 5.E, the o-ring 
SN Cy is generated by the class of all sets of the form C N Cy, 
where CeC. By 51.F (applied to the compact space Co), this 
o-ring is generated by the ring of all sets of the form EN Co, 
where E is a finite, disjoint union of proper differences of sets of 
€. According as the condition on € or on 0 is assumed, it follows 
from Theorems A, B, and C that every set in this ring is outer 
or inner regular. Since by Theorems C and D, the class of outer 
regular subsets of Co and the class of inner regular subsets of Co 
are both monotone classes, it follows from 6.B and Theorem E 
that, assuming either of the two conditions, if a subset of Co is 
in §, then it is regular, and hence, in particular, that Ep is 
regular. ff 


TheoremG. Every Baire measure v is regular; if C © C, then 
v*(C) = inf {v(Up): C C Up & Uo}, 
and, if U eV, then 
vx(U) = sup {v(Cy): U D Cy e Co}. 


Proof. Since every set in Cy may be written as the intersection 
of a decreasing sequence of sets of finite measure in Up, the 
regularity of »v follows from Theorem F. Since, by definition of 
outer measure, 


v*(C) = inf {v(Eo): C C Ey & So} S inf {v(Up): C C Up & Uo}, 
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for every « > 0, there exists a set Eg in Sp such that C c Ey and 
v(Eo) S v*(C) + . The outer regularity of Ey implies the 
existence of a set Uy in Up such that 


Ey Uy and (Up) S »(Ey) + = 


it follows that C C Up and v(Up) S v*(C) + €. The proof of the 
assertion concerning inner measure exploits, in an entirely similar 
way, the inner regularity of every Baire set Ey. Jj 


Theorem H. Let u be a Borel measure and let v be the Baire 
contraction of u (defined for every Baire set E by v(E) = yu(E)). 
Either of the two conditions, 


u(C) = v*(C) for all C in C, 
u(U) = v4(U) for all bounded open U in U, 


is necessary and sufficient for the regularity of u. If two regular 
Borel measures agree on all Baire sets, then they agree on all 
Borel sets. 


Proof. If, for some C in C, u(C) = v*(C), then according to 
Theorem G, for every « > O there exists a set Up in Up such that 


CCU, and p(Uo) = »(Uo) S »*(C) + € = u(C) + ¢; 


this implies that C is outer regular and hence that yu is regular. 
The proof of the sufficiency of the condition involving v» exploits, 
in an entirely similar way, the last assertion of Theorem G. 

Suppose next that p is regular and let ¢ be an arbitrary positive 
number. For any C in C, there exists a bounded set U in U such 
that CC U and w(U) S w(C) + «; similarly, for any bounded 
set U in U, there exists a set Cin C such that CC U and p(U) S 
u(C) + «. In either case, there exist sets Cy in Cy and Us in Up 
such that CC Up CG) CU, (50.D). It follows from Theorem 
G that 


v*(C) S o(Uo) = w(Uo) S u(U) S v(C) + «, 
and 


va(U) 2 v(Co) = u(Co) 2 u(C) 2 u(U) — 
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The arbitrariness of ¢ implies that 


v*(C) Su(C) and v(U) 2 u(U); 


the reverse inequality is obvious in both cases. Since it has thus 
been shown that the values of a regular measure on the Baire 
sets uniquely determine its values on the compact sets, the last 
assertion of the theorem follows from 51.F. Jj 

We conclude this section by introducing a concept which some- 
times provides a useful tool for proving regularity. If « is any 
Borel measure, its Baire contraction po, defined for all E in Sp 
by uo(Z) = u(£), is a Baire measure associated with pu in a natural 
way. If it happens that every set in C or every bounded set in U, 
and therefore in either case, every set in S is uo*-measurable 
(i.e. if all compact sets, and therefore all Borel sets, belong to 
the domain of definition of the completion of yo), then we shall 
say that the Borel measure y» is completion regular. If u is com- 
pletion regular, then to every Borel set E there correspond two 
Baire sets 4 and B such that 


ACECB and p(B- 4 =); 


it follows from Theorem H that completion regularity implies 
regularity. 


(1) Every Borel measure is o-finite. 

(2) If the space X is compact, then the class of all regular sets is a normal 
class; (cf. 6.2). 

(3) If » is a Borel measure and if there exists a countable set Y such that 
u(E) = w(E 2 Y) for every Borel set E, then u is regular. 

(4) If X is the Euclidean plane and if » is Lebesgue measure on the class of all 
Borel sets, then u is a regular Borel measure in the sense of this section. If, 
however, for every Borel set EZ, u(E) is defined to be the sum of the linear 
measures of all horizontal sections of E, then » is not a Borel measure. 

(5) Suppose that X is compact and x* is a point such that {x*} is not a Gs; 
(cf., for instance, 50.3). If, for every E in S, n(Z) = xx(x*), then p is a regular 
Borel measure which is not completion regular. 

(6) If ys, we, and » are Borel measures such that p = yy + pe, then the regu- 
larity of any two of them implies that of the third. (Hint: if CeC, UeU, 
CC U, and w(V) S w(C) + «, then 


a(C) + ue(U) S u(U) S i(C) + u2(C) + €) 
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(7) Suppose that X and Y are compact Hausdorff spaces, T is a continuous 
transformation from X onto Y, and y is a Borel measure on X. If v = pT@}, 
and if Dis a compact subset of Y, then D is regular with respect to v if and only 
ifC = T—'(D) is regular with respect tou. (Hint:ifC C Ue, then T(X — U) 
and D are disjoint compact sets in Y. If V is a neighborhood of D which is 
disjoint from T(X — U), then CC TV) C UV.) 

(8) If » is a regular Borel measure, then, for every o—bounded set E, 


e*(E) = inf {u(U): EC UeU} and yus(E) = sup {u(C): ED CeC}. 


(9) If 4 and v are Borel measures such that p is regular and yp, then » 
is regular, 

(10a) Let Q be the first uncountable ordinal, and let X be the set of all ordinals 
less than or equal toQ. Write X = X — {Q}. If the class of all “intervals” 
of the form {x:a < x S B} together with the set {0} is taken for a base, then X 
is compact. 

(10b) The class of all unbounded, closed subsets of X is closed under the 
formation of countable intersections. 

(10c) If, for every Borel set E in X, u(E) = 1 or 0 according as E does or does 
not contain an unbounded, closed subset of X, then p» is a Borel measure. 

(10d) The Borel measure pw is not regular, (Hint: every interval containing 
Q has measure 1.) 


§ 53. GENERATION OF BOREL MEASURES 


The purpose of this section is to show how certain (regular) 
Borel measures may be obtained from more primitive set functions. 

We define a content as a non negative, finite, monotone, addi- 
tive, and subadditive set function on the class C of all compact 
sets. In other words, a content is a set function \ on C which is 
such that (a) 0 S X(C) < © for all C in C, (b) if C and D are 
compact sets for which C c D, then X(C) S XD), (c) if C and 
D are disjoint compact sets, then A(C U D) = X(C) + A(D), and 
(d) if Cand D are any two compact sets, then A(C U D) S X(C) + 
(D). We observe that, since \(0) + A(0) = AO U 0) = AO) < 
©, a content must always vanish on the empty set. 

The outline of our procedure from now on will be as follows. 
In terms of a given content d we shall define a set function A» 
on the class of open Borel sets, and in terms of \s we shall define 
an outer measure u* on the class of all o—bounded sets. Then 
we shall use the already established theory of u*-measurability 
to obtain from the outer measure u* a measure » which will, in 
fact, turn out to be a regular Borel measure. 


232 LOCALLY COMPACT SPACES [Sc. 53} 


The inner content x, induced by a content 4, is the set func- 
tion defined for every U in U by 
r+(U) = sup {X(C): UDC eC}. 


Theorem A. The inner content \» induced by a content 
vanishes at 0, and is monotone, countably subadditive, and 
countably additive. 


Proof. It is obvious that \4(0) = 0. If U and V are in U, 
if U c PV, and if C is a compact set contained in U, then CCV 
and therefore \(C) S »«(V). It follows that 


Ae(U) = sup MC) S r¥(V). 

If U and V are in U and if C is a compact set such that CC U 
U V, then (50.A) there exist compact sets D and E such that 
DcU, ECV, and C=DUE. Since X(C) S$ MD) + NE) 
S da (U) + 24 (VY), it follows that 

Aw(U UV) = sup MC) S Ae (U) + a (Y), 

ie. that dX» is subadditive. It follows immediately, by mathe- 
matical induction, that \» is finitely subadditive. If {U;} is a 
sequence of sets in U and if C is a compact set such that CC 


U2, U;, then, by the compactness of C, there is a positive integer 
n such that Cc LU? U;. It follows that 


MC) S rx(Uier Us) S Die ae (U) S DO (UD), 
and therefore that 
rx(Ui1 Us) = sup MC) S VR (UD), 


i.e. that Ax 1s countably subadditive. 

Suppose next that U and V are two disjoint sets in U and let 
C and D be compact sets such that CC U and Dc V.. Since 
C and D are disjoint and since CU DCU U FP, we have 


MC) + MD) = ACU D) (UU YF), 
and therefore 
Ae (U) + Ax(V¥) = sup A(C) + sup A(D) S «(UU VY). 
The subadditivity of \* implies now that Ax is additive and hence, 
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by mathematical induction, that Ax is finitely additive. If {U,} 
is a disjoint sequence of sets in U, then 


Ae (Ui1 Us) = e(UPe1 UD) = Se (Ud); 
since this is true for every 2 = 1, 2, ---, it follows that 
Ae(Ui-1 U3) = > Ae (Uj). 


The countable additivity of \» follows from its already proved 
countable subadditivity. 

If X is a content and dx is the inner content induced by \, 
we define a set function u* on the hereditary o-ring of all 
o—bounded sets by 


u*(E) = inf {\e(U): Ec Ue Uh. 


The set function u* is called the outer measure induced by }; 
the terminology is justified by the following result. 


Theorem B. The outer measure y* induced by a content » 
is an outer measure. 


Proof. The equation u*(0) = 0 follows from the facts that 
0cOecU and d4(0) = 0. If E and F are two o—bounded sets 
such that E Cc F, and if U is a set in U such that Fc U, then 
E c U and therefore u*(E) S d+(U). It follows that 


u*(E) S inf d+(U) = u*(F). 


If {Z;} is a sequence of o-bounded sets, then, for every « > 0 
and for every 7 = 1, 2, ---, there exists a set U; in U such that 


E;cU,; and X4(U;) S u*(E,) + = 
It follows that 
o*(Ui-1 E) S 4(U21 UD) Ss DPM (UD) S Dit (LE) + 6 


the arbitrariness of ¢ implies the countable subadditivity of u*. J 

It might be conjectured that the procedures of Theorems A and 
B actually yield extensions of \ and x respectively, i.e., for 
instance, that u* is such that u*(C) = d(C) for every compact 
set C. This is not true in general; the best that can be said is 
contained in the following result. 
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Theorem C. Jf dx is the inner content and p* is the outer 
measure induced by a content d, then p*(U) = d4(U) for every 
U in U and p*(C°) S XC) S u*(C) for every C in C. 

(We recall that C° denotes the interior of the set C.) 


Proof. If UeU, then the relation UC UeU implies thar 
u*(U) Sre(U). If MeU and UC FV, then (VU) S x (/) and 


therefore 
A«(U) S inf r¥(V) = w*(0). 


If CeC, UeU, and CCU, then X(C) S X«(U), and there- 
fore 
XC) S inf A¥(V) = w*(C). 
If CeC, DeC, and Dc C (cC), then A(D) S X(C), and 


therefore 
u*(C°) = Ax(C®) = sup (D) S XC). FF 


Theorem D. [f u* is the outer measure induced by a content 
d, then a o-bounded set E is u*—measurable if and only if 


u*(U) 2 w*(U NE) + (UN EB’) 
for every U in U. 


Proof. Let \» be the inner content induced by 3, let 4 be an 
arbitrary o—bounded set, and let U be a set in U such that 4 c U. 
From the relations 

Ae(U) = w*(U) = eX(UN FE) + w(UN BE) 2 
2uX(AN E)+u*(4n L’) 
it follows that 
u*(4) = infdx(U) 2 u*(4N EZ) + (AN £); 
the reverse inequality and the converse follow from the subaddi- 
tivity of u* and the definition of u*-measurability. Jj 


Theorem E. If u* is the outer measure induced by a con- 
tent d, then the set function p, defined for every Borel set E by 
M(E) = p*(Z£), is a regular Borel measure. 


We shall call « the Borel measure induced by the content X. 
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Proof. We shall prove first that every compact set C (and 
therefore every Borel set) is 4*-measurable; it will then follow 
immediately that w is a measure on the class of all Borel sets. 
In virtue of Theorem D it is sufficient to prove that 


u*(U) 2 w*(U NC) + w*(U NC’) 


for all U in U. Let D be a compact subset of UN C’ and let 
E be a compact subset of U N D’; we observe that both the sets 
UNC’ and UND?’ belong to U. Since DN E=0 and 
DU ECU, it follows that 


u*(U) = r*(U) 2 MD U E) = X(D) + AZ), 
where \x is, of course, the inner content induced by A. Therefore 
u*(U) = MD) + sup ME) = MD) + (UN D’) = 
= (D) + w*(U ND’) 2D) + w*(O NC); 
this in turn implies that 
u*(U) 2 u*(U NC) + supA(D) = w(U NC) + Ae(U NC’) = 
=nw(UNC)+e*(UNC). 


To prove that u(C) < «, we observe that there exists a com- 
pact set F such that C c F*; it follows that 


u(C) = w*(C) S u*(F*) S MF) <<. 
The fact that the measure p is regular follows, finally, from the 
relations 
u(C) = w*(C) = inf {dy(U): C CU eV} = 
= inf {u*(U):C CU eU} = inf {u(U):C CUeV}. J 
We conclude with a result which we shall have opportunity to 
use later. 


Theorem F. Suppose that T is a homeomorphism of X 
onto itself and that \is acontent. If, for every C in C, \(C) = 
x(T(C)), and if w and fi are the Borel measures induced by 
and ) respectively, then Q(E) = p(T(E)) for every Borel set E. 
If, in particular, \ is invariant under T, then the same is true 


Of ut. 
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Proof. If \ and \, are the inner contents induced by \ and 
\ respectively, and if U e U, then the relations 


{A(C): UD CeC} = {A(T(C)): UDC eC} = 

{\(D): D = T(C), UDC eC} = 
{A(D): UD T-(D) eC} = 
{A(D): T(U) > D eC} 


imply that \4(U) = »A*(T(U)). If * and f* are the outer meas- 
ures induced by \ and 4 respectively, then a similar computation 
shows that, for every o—bounded set EF, *(E) = u*(T(E)), and 
hence that, for every Borel set E, A(E) = w(T(E)). The last 
assertion of the theorem is an immediate consequence of the 
preceding ones. J 


(1) The following are examples of non negative, finite set functions defined 
on the class C of all compact subsets of a locally compact Hausdorff space; some 
of them are contents, while others fail to possess exactly one of the principal 
defining properties (monotoneness, additivity, and subadditivity) of a content. 

(1a) X* is the one-point compactification of an infinite discrete space X; 
for every compact set C in X*, A(C) = 0 or 1 according as C is finite or infinite. 

(1b) X is a discrete space consisting of a finite number of points; A(C) = 1 
for every compact set C. 

ye X is the closed interval [—1, +1]; A(C) = 1 or 0 according as Oe C® or 
Oe’ 0. 

(Id) X* = {X,x*} is, as in (1a), the one-point compactification of an infinite 
discrete space X;A(C) = 1 or 0 according as x* © C or x* ©’ C. 


(le) X = {o, ao n= 1, 2, ae If C contains infinitely many nega- 


tive numbers, then A(C) = 0; otherwise A(C) = 1 or O according as 0eC or 
Oe'C. 
(1f) Let uo be a Baire measure on X, and, for every C in C, write 


A(C) = sup {uo(Co): cD Coe Co}. 


(1g) Let » be a Borel measure on X, and, for every Cin C, writeA(C) = n(C®). 

(2) IfX and X are two contents inducing the outer measures p* and fi* respec- 
tively, and if, for every C in C, d(C) = XC) S w*(C), then w* = p*. (Hint: 
in view of the first part of Theorem C, it is sufficient to prove that p*(U) = 
sup {\(C): UD Ce C} for every U in U.) 

(3) The result of (2) may be strengthened to the following converse of Theorem 
C. IfX and X are two contents, inducing the outer measures y* and ft* respec- 
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tively, and if, for every C in C, n*(C°) S X(C) S$ u*(C), then p* = A*. (Hint: 
Theorem E implies that 


u*(U) = sup {u*(C): UD CeC} 
for every Uin U. It is to be proved that 
u*(U) = sup {A(C): UD CeC}. 


Ife > Oand Ue, then there exists a set Cin C such that C C Uandu*(U) S$ 
u*(C) + «, and there exists a set D in C such that CC D.C DC U,) 

(4) Ifp is the Borel measure induced by a content A, and ifA(C) > O whenever 
C® 0, then n(U) > 0 for every non empty U in U. 

(5) Independently of any content \ we might consider those outer measures 
#* on the class of all o—bounded sets which have the property that 


u*(C) = inf {u*(U): CC UeU} <0 


for every Cin C. Are Theorems D and E true for any such outer measure? 


§ 54. REGULAR CONTENTS 


We have remarked before on the fact that the values of a con. 
tent need not coincide (on compact sets, of course) with the values 
of the Borel measure it induces. There is, however, an important 
class of contents which are such that the process of § 53 is actually 
an extension. In this section we shall study such contents and 
use our results to derive an important extension theorem which 
asserts, in fact, the existence of certain Borel measures whose 
uniqueness was established in 52.H. 

A content } is regular if, for every C in C, 


XC) = inf {A(D): CC D° Cc De Ch. 


This definition of regularity for contents imitates the definition 
of (outer) regularity for measures as closely as possible in view 
of the restricted domain of definition of a content. 


Theorem A. If uy is the Borel measure induced by a regular 
content , then p(C) = d(C) for every C in C. 


Proof. If C eC, then, because of the regularity of 4, for every 
¢ > 0 there exists a set D in C such that 


CcD® and XD) S$ XC) +. 
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It follows from 53.C that 
XC) S w(C) S w(D°) SAD) SAC) +; 
the desired result follows from the arbitrariness of «. J 
The following result goes in the converse direction. 


Theorem B. Jf u is a regular Borel measure and if, for every 
C in C, MC) = w(C), then d is a regular content and the Borel 
measure induced by \ coincides with p. 


Proof. It is clear that \ is a content. The regularity of pu 
implies that, for every C in C and for every € > 0, there exists a 
set U in U such that 


CcU and wV) Su(C) +. 
If D is a set in C such that Cc D° CD c U, then 
(D) = w(D) S w(V) S u(C) +e = MC) + 6; 


this proves the regularity of \. If f@ is the Borel measure induced 
by \, then, by Theorem A, 2(C) = X(C) = u(C) for every C in C, 
and therefore, indeed, fA =p. § 


Theorem C. Jf uo ts a Baire measure and if, for every C in 
C, 
x(C) = inf { uo(Uo): Cc Uo e Up}, 


then ) is a regular content. 


Proof. It is easy to verify that \ is non negative, finite, and 
monotone. 

If C and D are sets in C and Up and Vp are sets in Up such that 
CcU,and Dc PM, thenC UDC Uy U VY € Up, and therefore 


MC U D) S wo(Uo U Vo) S wo(Uo) + uo(Y%o). 
It follows that 
XC U D) S inf po(Uo) + inf wo(4%o) = MC) + A(D), 


i.e. that X is subadditive. 
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If C and D are disjoint sets in C, then there exist disjoint sets 
Uo and Vo in Up such thatCc Uo andDc Vo. Ifc UD cH, 
e Up, then 


XC) + MD) S po(Uo N Wo) + wo(¥o N Wo) S woo), 
and therefore 
xX(C) + A(D) S inf po(Wo) = XC U D). 


The fact that \ is additive follows from the fact, proved above, 
that d is subadditive. 

To prove that ) is regular, let C be any compact set and let ¢ 
be any positive number. By the definition of d, there exists a 
set Uy in Up such that 


CcUp and pw(U) SAC) +e. 
If D is a compact set such that CC D° C DC Us, then 
MD) S uo(Uo) S$ MC) +e | 


Theorem D. Jf uo is a Baire measure, then there exists a 
unique, regular Borel measure wy such that p(E) = po(E) for 
every Batre set E. 


Proof. If, for every C in C, 
MC) = inf {uo(Uo): C C Up € Uo}, 


then, by Theorem C, \ is a regular content; let » be the regular 
Borel measure induced by 4. By Theorem A, u(C) = X(C) for 
every C in C. Since (52.G) every Baire measure is regular, we 
have A(C) = uo(C), and consequently u(C) = uo(C) for every C 
in Co. This proves the existence of »; uniqueness was explicitly 
stated and proved in 52.H. J 


(1) Which of the set functions described in 53.1 are regular? 

(2) If, in the notation of 53.F, \ is a regular content, then so is X. 

(3) If » is a Borel measure and if, for every C in C, A(C) = sup {u(Co): 
CD Coe Co}, then pu is completion regular if and only if X is a regular content; 
(ef. 53.1f). 

(4) A content \ is inner regular if, for every C in C, X(C) = sup {A(D): 
C°D DeC}. The following analogs of Theorems A and B are true. 
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(4a) If u is the Borel measure induced by an inner regular content A, then 
u(C°) = X(C) for every C in C. 

(4b) If w is a regular Borel measure and if, for every C in C, A(C) = u(C), 
then ) is an inner regular content and the Borel measure induced by \ coincides 
with p. 


§ 55. CLASSES OF CONTINUOUS FUNCTIONS 


If X is, as usual, a locally compact Hausdorff space, we shall 
denote by £(X) or simply by & the class of all those real valued, 
continuous functions on X which vanish outside a compact set. 
In other words & is the class of all those continuous functions / 
on X for which the set 


Nf) = {x: f(x) ¥ 0} 


is bounded. If X is not compact and if X* is the one-point com- 
pactification of X by x*, then the point x* is frequently called 
the point at infinity, and consequently £ may be described as the 
class of all those continuous functions which vanish in a neighbor- 
hood of infinity. We shall denote by £,(X) or simply by £4 
the subclass of all non negative functions in &. The first of our 
results concerning these function spaces has been implicit in many 
of our preceding constructions. 


Theorem A. If C is any compact Baire set, then there exists 
a decreasing sequence {fn} of functions in &4 such that 


lima fa(x) = xe(x) 
for every x in X. 


Proof. If C = ()r-; U,;, where each U,, is a bounded open 
set, then for each positive integer 7 there exists a function gn 
in & (cf. § 50) such that 
1 if «eC, 


En(x) = f ‘foe. 


If f, = g1 N---N ga, then {f,} is a decreasing sequence of non 
negative continuous functions such that 


lima fn(x) = xe(x) 
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for every x in X; the fact that U,, is bounded implies that /, € £4, 
n=1,2,---. J 

If yo is a Baire measure in X, if f © &, and if {x: f(x) 0} c 
C eCo, then the facts that uo(C) < © and that f is bounded and 
Baire measurable (51.B) imply that f is integrable with respect 


to vo and that 
Jidu0 = f fa 


These statements are true, in particular, if u is a Borel measure 
and po is its Baire contraction. 


Theorem B. Jf a Baire measure uw is such that the measure 
of every non empty Baire open set is positive, and if fe £4, 


then a necessary and sufficient condition that f Sdu = 0 is that 

S(*) = 0 for every x in X. 

Proof. The sufficiency of the condition is trivial. To prove 
necessity, suppose that f fdp = 0 and let U be a bounded open 
Baire set such that {x: f(x) #0} CU. If E = {x: f(x) = 0}, 


then, since 
a > 


it follows frorn the fact that f is non negative that n(U — E) = 0. 
Since U — E is an open Baire set, we must have U — E = 0, or 


UcE. J 


Theorem C. Jf yo is a Baire measure and «> 0, then, 
corresponding to every integrable simple Baire function f, there 
exists an integrable simple function g, 


f= Dnt XC 


such that C;is a compact Batre set,i = 1, ---, n, and 


fis g ldo S « 


Proof. Write f = > ?.; aixz, and let ¢ be a positive number 
such that | f(x) | S ¢ for every x in X (i.e. such that | a;| S ¢ 
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for i = 1, ---, ). The regularity of uo implies that, for each 
i = 1, ---,, there exists a compact Baire set C; such that 


Cio By and wo(E) S mo(C) + = 
It follows that if g = D\?.1 axe, then 
fle = gléwo = Dial es lus C) Se 0 


Theorem D. Jf yo is @ Baire measure, if «> 0, and if 
g = Drei axe, is a simple function such that C; is a compact 
Batre set,i = 1, +--+, n, then there exists a function hin & such 
that 


fis ~ Aldwo < « 


Proof. Since {C;, --+, C,} is a finite, disjoint class of compact 
sets, there exists a finite, disjoint class {U,, ---, U,} of bounded 
open Baire sets such that C; C U;,i = 1, ---,. Because of the 
regularity of yo, there is no loss of generality in assuming that 


wo(U;) S po(C;) ps; i=1,---,n, 
ne 


where ¢ is a positive number such that | g(x) | Sc for every x 
in X. For each i = 1, ---, m, there exists a function 4; in § 
such that 4;(x) = 1 for « in C; and A,(~) = 0 for x in X — U;; 
we write A= Doin, aA; Since hye £4, i =1, +++, n, it is clear 
that 4 © £; the disjointness of the U; implies that | A(x) | S ¢ for 
all xin X. We have 


fis —h|dyo = Ditar et h|duo S Dheie-m(U;-C) Se | 


(1) If u is a regular Borel measure, then the class of all finite linear combina- 
tions of characteristic functions of compact sets is dense in £,(u), 1S p<». 

(2) If w is a regular Borel measure, then £& is dense in £,(u), 1 S p < @. 

(3) If w is a regular Borel measure, E is a Borel set of finite measure, and f 
is a Borel measurable function on £, then, for every € > Q. there exists a compact 
set C in E such that p(EZ — C) S € and such that fis continuous on C. (Hint: 
if f is a simple function, the result may be proved by the technique used in the 
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proof of Theorem C. In the general case, there exists a sequence { f,} of simple 
functions converging to f; by Egoroft’s theorem and the regularity of yu, there 


exists a compact set Cy in E such that u(EZ) S u(Co) + ; and such that {f,} 
converges to f uniformly on Co. Let Cy, be a compact subset of E such that 
u(E) S$ ul(C,) + a and such that f, is continuous on C,; the set 

C= Ns-o Cn 


satisfies the required conditions.) This result is known as Lusin’s theorem. 


§ 56. LINEAR FUNCTIONALS 


A linear functional on & is a real valued function A of the func- 
tions in £& such that 


A(af + Bg) = aA(f) + BA(g) 


for every pair, f and g, of functions in £ and every pair, a and 8, 
of real numbers. A linear functional A on & is positive if A(f) = 0 
for every f in £4. We observe that a positive linear functional A 
is monotone in the sense that if fe &, ge £, and f 2 g, then 
A(f) 2 A(g). It is easy to verify that if uo is a Baire measure in 


X and if A(f) = f Jdvo for every f in &, then A is a positive linear 


functional; the main purpose of this section is to show that every 
positive linear functional may be obtained in this way. 

We shall find it convenient to employ a somewhat unusual but 
very suggestive notation. If EF is any subset of X and / is any 
real valued function on X, then we shall write Z cf [or E > /] 
if x(x) S f(«) [or xz(*) = /(«)] for every x in X. 


Theorem A. Jf A is a positive linear functional on & and if, 
for every C in C, 


MC) = inf {A(/): C Cfe 24}, 
then » is a regular content. If wis the Borel measure induced 
by d, then 
u(U) S A(f) 


for every bounded open set U and for every f in &4 for which 
Ucf. 
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Proof. The fact that A is positive implies that \(C) 2 0 for 
every Cin C. To prove that d is finite, let C be any compact 
set and let U be any bounded open set containing C. Since there 
exists a function fin £4 such that f(x) = 1 for xin Cand f(x) = 0 
for x in X — U, it follows that C Cf e &4 and therefore 


x(C) S A(f) < &. 


If C and D are compact sets, C > D, and if CC fe &4, then 
Dc/f, and, therefore, \(D) S A(/). It follows that \(D) S 
inf A(f) = X(C), i.e. that \ is monotone. 

If Cand D are compact sets, andifC Cfe&,andDCge&i, 
then 

CUDCSf+ge&, 


and therefore \(C U D) S A(f +g) = A(/) + A(g). It follows 
that 


x(C U D) S inf ACf) + inf A(g) = x(C) + X(D), 


i.e. that \ is subadditive. 

If C and D are disjoint compact sets, then there exist disjoint 
bounded open sets U and V such that CC Uand DC V. Let 
J and g be functions in £4 such that f(x) = 1 for «in C,f(~) = 0 
for xin X — U, g(x) = 1 for xin D, and g(x) = 0 for xin X — V. 
IfCUDcChe &,, then 


X(C) + A(D) S ACAf) + ACAg) = ACA(S + g)) S ACA). 
It follows that 
d(C) + (D) S inf A(A) = (CU D); 


the additivity of \ now follows from its subadditivity. 

We have thus proved that \ is a content; it remains to prove 
that \ is regular. For every C in C and for every e > 0, there 
exists a function f in &, such that 


Ccf and A(f) $C) +5 
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If y is a real number, 0 < y < 1, andif D = {x: f(x) 2 y}, then 


Cc {x:f(x) 21} C {x:f(x) > y} CD° Cc De. 
1 
Since D c —f & £4, 1t follows that 
Y 
1 1 € 
MD) 5 -A(/) $ — (NC) + 5). 
¥ Y 2 
Since y may be chosen so that 
~(n(C) + S) = x(C) + 
~(xc) +5) SMO) + 6 


it follows that (D) S X(C) + e; the arbitrariness of ¢ implies 
that d is regular. 

The last assertion of the theorem is an easy consequence of 
the regularity of u. Indeed, if C is a compact set contained in U, 
then C cf and therefore 


w(C) = AC) SAY); 
it follows that u(U) = sup u(C) SACS). Ff 


Theorem B. Jf A is a positive linear functional on &, if, 
for every Cin C, 


XC) = inf {A(/): Ccfe £4}, 


and if » 1s the Borel measure induced by the content d, then 


fins a) 
for every f in £4. 
Proof. Since both f fdu and A(f) depend linearly on f, it is 


sufficient to prove the inequality for functions f such that 
0 < f(x) S 1 for all x in X. 
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Let ” be a fixed positive integer and write, for i = 1, ---, n, 


0 if f(x) < —, 
ae ; i-1 i 
fs) = jo = fle) —- G- 1) if — sf) s-, 
- n n 
n . 
1 if ~ < fix). 
n 


Since, for 7 = 1, ---, 7, 

fi = (Inf — @ — 12] U0) NL = (af — @ — 1) N 1) UO, 
the functions “ all belong to £4. Since for any x for which 
a Ss f(x) st ~ , we have 


1 if lsisj-1, 
Ne to if jtlsisn, 
1 
it follows that f(x) = — 35%... f,(~) for every x in X. 
n 


If, for 2 = 0, 1, ---, my U; = {x= £0) >I, then U; is a 
n 


bounded open set such that, fori = 1, ---,”, U; C fi, and hence, 
by Theorem A, u(U;) S$ A(fi). Since Up D Ui D---D Uy, = 0, 
we have 


1 1 
AY) = Fs Din Af) 2 3 De w(U;) = 


efi 
- ma -—) wu) = 
eS rele -[u(Us) — w(Ui41)] = 


+1 1 
= aye 2 Ws - Ui41) ba 7 BO) 2 
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1 
n—1 a = 
a ee Sif 7 MO) 


1 1 
7 f fay ~ = w(Us) fren - — u(Uo). 


The arbitrariness of 7 and the finiteness of u(Uo) imply the desired 
result. fj 


Theorem C. Jf A is a positive linear functional on &, if, 
for every C in C, 


XC) = inf {A(f): C Cfe £4}, 


and if pis the Borel measure induced by the content , then, 
corresponding to every compact set C and every positive number 
e, there exists a function fo in &4 such thatC Cfo, fo S 1, and 


A(fo) $ [fodu + « 
Proof. Jet go be a function in £4 such that 
CCgo and A(go) S$ MC) + 


If fo = zo 1 1, then it follows that 
A(fo) S Algo) S w(C) +e S ffodute A 


Theorem D. Jf A is @ positive linear functional on &, then 
there exists a Borel measure p such that, for every f in &, 


ACA) = [fds 


Proof. Write \(C) = inf {A(/):C Cf e £4} for every Cin C, 
let » be the Borel measure induced by the content 2, and let f 
be any fixed function in &£. 


Let C be a compact set such that {x: f(x) ~ 0} CC, and let e 
be a positive number. According to Theorem C, there exists a 


function fy in £4. such that C C fo, fo $1, andA(fo) S f. Sodu + «. 
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We observe that since C C fy, it follows that fo=/f. If c 
is a positive number such that | f(~) | S ¢ for all x in X, then the 
function (f + c)fo belongs to £4 and hence, by Theorem B, 


A(f) + cA(fo) = Mf + fo) = f + fedu = 
= fife +e fodu. 


It follows that 
A) & fidu + | Pfodu — nfo] & [ida ~ 06 


the arbitrariness of ¢ implies that A(f) 2 f ‘fu, i.e. that Theorem 


B is true for all fin &. Applying this inequality to —f yields its 
own reverse. J 


Theorem E. Jf u is a regular Borel measure, if, for every 
f in & ACS) = [fu and if, for every C in C, 


XC) = inf {A(/): CCfe ss}, 


then p(C) = XC) for every C in C. Hence, in particular, the 
representation of a positive linear functional as an integral with 
respect to a regular Borel measure is unique. 


Proof. It is clear that w(C) S$ X(C). If CeC and e > 0, 
then, by the regularity of », there exists a bounded open set U 
containing C such that u(U) S uw(C) + e«. Let f be a function 
in § such that f(x) = 1 for x in C and f(x) = O for x in X — U; 
then C Cf e £4 and 


XC) SAV) = ffdu 5 uU) S uC) + 
The arbitrariness of ¢ implies the desired result. J 


(1) If xo is a point of X and A(/) = f(xo) for every fin &, and if u(E) = xz(xo) 
for every Borel set E, then A(/) = {sau 


(2) If yo is a Baire measure and A(/) = {fda for every f in &, and if uz isa 


(Src. 56] LOCALLY COMPACT SPACES 249 


Borel measure such that A(/) = Siew, then w(Z) = po(£) for every Baire set E. 
(3) If uo is a Baire measure and A(/) = {fan for every f in &, write 


(UV) = sup {A(/): UD fe £4} 
for every U in U, and 
u*(E) = inf {\y(U): EC UeU} 


for every o—bounded set E; then u*(E) = po(Z) for every Baire set E. 

(4) Let X be the one-point compactification, by «, of the countable discrete 
space of positive integers. A function/ in £ is, in this case, a convergent sequence 
{f(n)} of real numbers with f(0) = lim, f(a); the most general positive linear 
functional A is defined by 


Af) = Disnsef(m)An, 


where 2, An is a convergent series of positive numbers. 

(5) A linear functional A on £ is bounded if there exists a constant & such that 
| AC) | S &sup {| f(x) |: «© X} for every fin &. Every bounded (but not neces- 
sarily positive) linear functional is the difference of two bounded positive linear 
functionals. The proof of this assertion is not trivial; it may be achieved by 
imitating the derivation of the Jordan decomposition of a signed measure. 

(6) If X is compact, then every positive linear functional on £& is bounded. 


Chapter XI 


HAAR MEASURE 


§ 57. FULL SUBGROUPS 


Before beginning our investigation of measure theory in 
topological groups, we shall devote this brief section to the proof 
of two topological results which have important measure theoretic 
applications. The results concern full subgroups; a subgroup Z 
of a topological group X is full if it has a non empty interior. 
We shall show that a full subgroup Z of a topological group X 
embraces the entire topological character of X—everything in X 
that goes beyond Z is described by the left coset structure of Z 
which is topologically discrete. We shall show also that a locally 
compact topological group always has sufficiently small full sub- 
groups—i.e. full subgroups in which none of the measure theoretic 
pathology of the infinite can occur. 


Theorem A. Jf Z is a full subgroup of a topological group 
X, then every union of left cosets of Z is both open and closed 
in X. 


Proof. Since the complement of any union of left cosets is 
itself such a union and since a set whose complement is open 
is closed, it is sufficient to prove that every such union is open. 
Since a union of open sets is open, it is sufficient to prove that 
each left coset of Z is open, and for this, in turn, it is sufficient to 
prove that Z is open. 

Since Z° # 0, there is an element % in Z°. If zis any element 


of Z, then z%~! « Z and therefore 2%—!Z = Z. It follows that 
250 
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zzgtZ° = Z° and hence that 
Z = (2% )zy & Z. 


Since z is an arbitrary element of Z, we have thereby proved that 
Zc 2°, i.e. that Zis open. §f 


Theorem B. Jf E is any Borel set in a locally compact 
topological group X, then there exists a o-compact full subgroup 
Z of X such thatE CZ. 


Proof. It is sufficient to prove (cf. 51.A) that if {C,} is a 
sequence of compact sets in X, then there exists a a full 
subgroup Z of X such that C, C Z for n = 1, 2, 

Let D be a compact set which contains a neenboiieed of e. 
We write Dy = D and, for n = 0, 1, 2, 


Drs = Dn™Da U Ca41- 


If Z = Us-o D,, then Z is o—compact, has a non empty interior, 
and contains each C,; we shall complete the proof by showing 
that ZZ c Z. 

We show first that if, for any » = 0, 1, 2, ---, ee Da, then 
Dn © Dry. Indeed, if ¢ e D,, then ee D,~; it follows that 
if x e D,, then 


“Xe (D,a)« Cc Dna™Da Cc Da+t- 
Since e ¢ Dp, it follows by mathematical induction that D, C Dn41 
for n = 0, 1, 2, - 
If x and y are any two elements of Z, then, because of the result 


of the preceding paragraph, both x and y belong to D,, for some 
positive integer 7, and therefore 


xy e Da Da C Dairy CZ. | 


§ 58. EXISTENCE 


A Haar measure is a Borel measure yu in a locally compact 
topological group X, such that p(U) > 0 for every non empty 
Borel open set U, and u(xE) = p(E) for every Borel set E. The 
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purpose of this section is to prove, for every locally compact 
topological group, the existence of at least one Haar measure. 

The second defining property of a Haar measure may be called 
left invariance (or invariance under left translations); we observe 
that the first property is equivalent to the assertion that yu is not 
identically zero. Indeed, if u(U) = 0 for some non empty Borel 
open set U, and if Cis any compact set, then the class {xU: x e C} 
is an open covering of C. Since C is compact, there exists a finite 
subset {x;, ---, xn} of C such that CC UL, «,U, and the left 
invariance of uw implies that u(C) S >i. u(xiU) = nu(U) = 0. 
Since the vanishing of u on the class C of all compact sets implies 
its vanishing on the class S of all Borel sets, we obtain the desired 
result: a Haar measure is a left invariant Borel measure which is 
not identically zero. 

Before exhibiting the construction of Haar measures, we remark 
on the asymmetry of their definition. Left translations and right 
translations play a perfectly symmetrical role in groups; there is 
something unfair about our emphasis on left invariance. The 
concept we defined should really be called “left Haar measure”; 
an analogous definition of “right Haar measure” should accom- 
pany it, and the relations between the two should be thoroughly 
investigated. Indeed in the sequel we shall occasionally make use 
of this modified (and thereby more precise) terminology. In most 
contexts, however, and specifically in connection with the problem 
of the existence of Haar measures, the perfect left-right symmetry 
justifies an asymmetric treatment; since the mapping which 
sends each x in X into x! interchanges left and right and preserves 
all other topological and group theoretic properties, every “left 
theorem” automatically implies and is implied by its correspond- 
ing “right theorem.” It is, in particular, easy to verify that if » 
is a left Haar measure, and if the set function » is defined, for every 
Borel set E, by »(Z) = p(E~), then » is a right Haar measure, 
and conversely. 

If E is any bounded set and F is any set with a non empty 
interior, we define the “ratio” E:F as the least non negative 
integer ” with the property that E may be covered by 7 left 
translations of F, i.e. that there exists a set {x1, -+-, xn} of 7 
elements in X such that EC Uj.ixF. It is easy to verify 
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that (since E is bounded and F° is not empty) E:F is always 
finite, and that, if 7 has the properties of both E and F, i.e. if 4 
is a bounded set with a non empty interior, then 


E:F s (E:4)(4:F). 


Our construction of Haar measure is motivated by the follow- 
ing considerations. In order to construct a Borel measure in a 
locally compact Hausdorff space it is sufficient, in view of the 
results of the preceding chapter, to construct a content \, i.e. 
a set function with certain additivity properties on C. If Cis a 
compact set and U is a non empty open set, then the ratio C:U 
serves as a comparison between the sizes of C and U. If we form 
the limit, in a certain sense, of the product of this ratio by a 
suitable factor depending on the size of U, as U becomes smaller 
and smaller, the resulting number should serve as the value of 
rat C. 

The outline in the preceding paragraph is not quite accurate. 
In order to illustrate the inaccuracy and make our procedure 
more intuitive, we mention an example. Suppose that X is the 
Euclidean plane, » is Lebesgue measure, and C is an arbitrary 
compact set. If U, is the interior of a circle of radius r, and 
if we write, for every r>0, n(r) =C:U,, then, clearly, 
n(r)ar? = w(C). It is known that lim,_,9 2(r)a7? exists and is 


equal not to p(C) but to u(C); in other words, starting 


with the usual notion of measure, which assigns the value rr? to 
U,, our procedure yields a different measure which is a constant 
multiple of the original one. For this reason, in an attempt to 
eliminate such a factor of proportionality, we shall replace the 
ratio C:U by the ratio of two ratios, i.e. by (C: U)/(4:U), where 
A is a fixed compact set with a non empty interior. 


Theorem A. For each fixed, non empty open set U and 
compact set A with a non empty interior, the set function dv, 
defined for all compact sets C by 
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is non negative, finite, monotone, subadditive, and left invariant; 
it is additive in the restricted sense that if C and D are compact 
sets for which CU“! N DU™ = 0, then 


Au(C U D) = Av(C) + Av(D). 


Proof. The verification of all parts of this theorem, except 
possibly the last, consists of a straightforward examination of the 
definition of ratios such as C:U. To prove the last assertion, let 
xU be a left translation of U and observe that if CN «U #0, 
then x e CU and if DN «U #0, then xe DU. It follows 
that no left translation of U can have a non empty intersection 
with both C and D and hence that dy has the stated additivity 


property. J 


Theorem B. Jn every locally compact topological group X 
there exists at least one regular Haar measure. 


Proof. In view of 53.E and 53.F it is sufficient to construct 
a left invariant content which is not identically zero; 53.C implies 
that the induced measure is not identically zero and hence is a 
regular Haar measure. 

Let 4 be a fixed compact set with a non empty interior and let 
N be the class of all neighborhoods of the identity. For each 
U in N, we construct the set function dy, defined for all compact 
sets C by dAy(C) = — 7 since C:U S (C:4)(A4:U), it follows 
that 0 S AYp(C) S C:4 for every Cin C. Theorem A shows that 
each dy is almost a content; it fails to be a content only because 
it is not necessarily additive: We shall make use of the modern 
form of Cantor’s diagonal process, i.e. of Tychonoff’s theorem on 
the compactness of product spaces, to pick out a limit of the Ay’s 
which has all their properties and is in addition additive. 

If to each set C in C we make correspond the closed interval 
[0,C: 4], and if we denote by ® the Cartesian product (in the 
topological sense) of all these intervals, then & is a compact 
Hausdorff space whose points are real valued functions ¢ defined 
on C, such that, for each C in C, 0 < 6(C) S$ C:A. For each 
U in N the function Xx 1s a point in this space. 
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For each U in N we denote by A(U) the set of all those func- 
tions Ay for which V Cc U; i.e. 


A(U) = (yy: U DV eN}. 


if {U;, «++, Up} is any finite class of neighborhoods of the identity, 
i.e. any finite subclass of N, then ()}., U; is also a neighborhood 
of the identity and, moreover, 


Nir 0; CU;, f=), +++, 0. 


It follows that 
A((Nin1 Us) C Ii AUD), 


and hence, since A(U) always contains dy and is therefore non 
empty, that the class of all sets of the form A(U), U e N, has the 
finite intersection property. The compactness of ® implies that 
there is a point \ in the intersection of the closures of all A(U); 


Nef) {A(U): Ue N}. 


We shall prove that \ is the desired content. 

It is clear that 0 S X(C) S$ C:4 < & for every Cin C. To 
prove that \ is monotone, we remark that if, for each fixed C 
in C, &c(¢) = $(C), then & is a continuous function on &, and 
hence, for any two compact sets C and D, the set 


A = [¢:4(C) S ¢(D)} c@ 


is closed. If CCD and UeN, then dy €A and consequently 
A(U) CA. The fact that A is closed implies that Xe A(U) CA, 
i.e. that \ is monotone. 

The proof of the subadditivity of X is entirely similar to the 
above continuity argument; we omit it and turn instead to the 
proof that \ is additive. If C and D are compact sets such that 
CN D =0, then there exists a neighborhood U of e such that 
CU1n DUt=0. If Ye N and VY cU, then CV N DV 
= 0 and hence (Theorem A) 


Av(C U D) = dAy(C) + Av(D). 


This means that, whenever V c U, dy belongs to the closed set 
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= {¢: o(C U D) = o(C) + 9(D)}, and hence that A(U) CA. 


i follows that \ © A(U) cA, i.e. that d is additive. 

Another application of the continuity argument shows that 
(4) = 1 (since Ay(4) = 1 for every U in N), and hence that the 
set function \ (which is already known to be a content) is not 
identically zero. The fact that is left invariant follows, again 
by continuity, from the left invariance of each Ay. 


(1) The existence cf a right Haar measure follows from the existence of a left 
Haar measure by consideration of the group X dual to X. The group X has, 
by definition, the same elements and the same topology as X; the product 
(in X) of two elements x and y (in that order) is, however, dehried to be the 
product (in X) of y and x (in that order). 

(2) Haar measure is obviously not unique, since, for any Haar measure p 
and any positive number ¢, the product cu is also a Haar measure. 

(3) If, for every U in N, Av is the set function eae in Theorem A, then, 


for every compact set C with C° #0, 0<——~SAad(C). It follows tnat 


X(C) > O whenever C® ¥ 0. 
(4) The following is a well known example of a group in which the left and 
right Haar measures are essentially different. Let X be the set of all matrices 


ria 


of the form ( +) , where 0 < x < 0 and —% <_y < +00; it is easy to verify 


that, with respect to ordinary matrix multiplication, X is a group. "If X is 
topologized in the obvious way as a subset (half plane) of the Euclidean plane, 
then X becomes a locally compact topological group. If we write, for every 
Borel set E in X, 


ji) sa f f, 55 dedy and »(E) = fi f= ady 


(where the integrals are with respect to Lebesgue measure in the half plane), 
then uw and v are, respe-tively, left and right Haar measures in X. Since u(E7') 
= v(£), this example shows also that there may exist measurable sets E for which 
u(E) <0 and p(E-!) = &, 

(5) If C and D are two compact sets such that u(C) = u(D) = 0, does it 
follow that u(CD) = 0? 

(6) If uw is a Haar measure in X, then a necessary and sufficient condition 
that X be discrete is that u({x}) x 0 for at least one x in X. 

(7) Every locally compact topological group with Haar measure satisfies the 
condition of 31.10; (cf. § 57). 

(8) If a Haar measure p in X is finite, then X is compact. 

(9) If wis a Haar measure in X, then the following four assertions are mutually 
equivalent: (a) X is o~com pact; (b) u is totally o—finite; (c) every disjoint class 
of non empty open Borel sets is countable; and (d) for every non empty open 
Borel set U, there exists a sequence {xn} of elements in X such that 


xX = Us. 1 xnU, 
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§ 59. MEASURABLE GROUPS 


A topological group is, by definition, a group X with a topology 
satisfying a suitable separation axiom and such that the trans- 
formation (from X X X onto X) which takes (x,y) into x7ly 
is continuous. For our present purposes it is convenient to replace 
this definition by an equivalent one which requires that the 
transformation S (from X X X onto itself), defined by S(x,y) = 
(x,xy) be a homeomorphism. If, indeed, X is a topological group 
in the usual sense, then it follows that S is continuous; since S 
is clearly one to one and S—'(x,v) = (x,x~1y), it follows similarly 
that J! is continuous, and hence that S is a homeomorphism. 
If, conversely, it is known that S is a homeomorphism, then S~! 
is continuous and, therefore, so is the transformation S~! followed 
by projection on the second coordinate. (In case X is the real line, 
the transformation S is easy to visualize; its effect is that of a 
shearing which moves every point in the plane vertically by an 
amount equal to its distance from the y-axis.) 

Motivated by the preceding paragraph and the fact that every 
locally compact topological group has a Haar measure, we define 
the following measure theoretic analog of the concept of a topo- 
logical group. A measurable group is a o-finite measure space 
(X,S,u) such that (a) uw is not identically zero, (b) X is a group, 
(c) the o-ring S and the measure u are invariant under left trans- 
lations, and (d) the transformation S of X X X onto itself, 
defined by S(x,y) = («,xy), is measurability preserving. (To say 
that S is invariant under left translations means, of course, that 
xE eS for every x in X and every E in S; by a measurable subset 
of X X X we mean, as always, a set in the o-ring S X S.) 

If X is a locally compact group, S is the class of all Baire sets 
in X, and » is a Haar measure, then the fact that S is a homeo- 
morphism (and therefore Baire measurability preserving), to- 
gether with the fact (51.E) that the class of all Baire sets in X X X 
coincides with S X S, implies that (X,S,u) is a measurable group. 
The main purpose of the following discussion of measurable groups 
is to see how much one can say about a locally compact topological 
group by exploiting its measure theoretic structure only. 
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If X is any measurable space (and hence, in particular, if X 
is any measurable group), then the one to one transformation 
R of X X X onto itself, defined by R(«,y) = (y,x), is measura- 
bility preserving—the reason for this is the immediately verifiable 
fact that if E is a measurable rectangle, then so also are R(E) 
and R7'(E)(= R(E)). Since the product of measurability preserv- 
ing transformations is measurability preserving, this remark gives 
us a large stock of measurability preserving transformations in a 
measurable group—namely all transformations which may be 
obtained by multiplying powers of S and R. We shall in particu- 
lar frequently use, in addition to the shearing transformation S, 
its reflected analog T = R~'SR; we observe that T(x,y) = (yx,y). 

Throughout the remainder of this section we shall assume that 


p and » are two measures (possibly but not necessarily identi- 
cal) such that (X,S,u) and (X,S,v) are measurable groups, and 
R, S, and T are the measurability preserving transformations 
described in the preceding paragraphs. 
Theorem A. Jf E is any subset of X X X, then 
(S(E))2 = *E, and (T(E))" = yE" 
for every x and y in X. 
Proof. The result for S follows from the relation 
xsE(%,y) = xe(x,x~1y), 
together with the facts that y e (S(E£)), if and only if xsca(x,y) 
1, and x~'y e E, if and only if xz(x,«y) = 1. The proof for 
is similar. J 


wa il 


Theorem B. The transformations S and T are measure 
preserving transformations of the measure space (X X X, 
S XS, u X v) onto itself. 


Proof. If E is a measurable subset of X X X, then, by 
Fubini’s theorem and Theorem A, 


(u X »)(S(E)) = fr((S(D)2)duls) = fo(xE due) = 


ee f v(Ez)du(x) = (« X »)(E); 
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this establishes the measure preserving character of S. The result 
for T follows similarly by considering the sections (T(E))”. J 


Theorem C. Jf Q = SRS, then 

(Q(A4 X B)),. = x4 N BO, 
Ay ff yeB, 
0 if ye'B. 


Proof. We observe that Q(x,y) = (xy,y7) and that Q7' = Q. 
The first conclusion follows from the relation 


and 


(Q(4 x By = | 


xoaxsy (x,y) = xXaxe(x yy) = xea(y)xe(y™), 


together with the facts that y e(Q(4 X B)),-1 if and only if 
Xoaxs) (x,y) = llandy exd N B- if and only if x24(y)xe(y71) 
= 1. The second conclusion follows from the relation 


xocaxB)(*,9") = xaxe (xy ,y) = xay(*)xe(y), 


1 


together with the facts that x e(Q(4 X B))” if and only if 
xotaxe)(%,y") = 1, and that xe dy and y eB if and only if 
xav(*)xe(y) = 1. Of 


Theorem D. Jf 4 is a measurable subset of X [of positive 
measure), and y © X, then Ay is a measurable set [of positive 
measure| and A~ is a measurable set [of positive measure]. 
If f is a measurable function, A is a measurable set of positive 
measure, and, for every x in X, g(x) = f(x«—")/u(Ax), then g is 
measurable, 


Proof. The measurability of Ay follows by selecting any 
measurable set B containing y and observing that, according to 
Theorem C, Ay is a section of the measurable set Q(4 X B) 
(where Q = S“1RS). For the remainder of the proof we shall 
make use of the fact that Q is a measure preserving transforma- 
tion of (X x X, S XS, » X up) onto itself. It follows that if 
u(A) > 0, then, by Theorem C, 


0< WA)? = eX QA x A) = fue 0 Adu), 
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and hence, in particular, that x~'4 NM 47 is a measurable set 
of positive measure for at least one value of x. We have proved, 
in other words, that if 4 is a measurable set of positive measure, 
then there exists a measurable set B of positive measure such 
that B c 4~'. (This implies, in particular, that as soon as we 
will have proved that 47! is measurable, the result concerning 
the positiveness of »(4~!) will follow automatically.) Since 
Bc A™ implies that y"B Cy 147, and since u(y'B) = 
u(B), another application of our result yields the existence of a 
measurable set C of positive measure such that C Cc (y'B)" c 
(y14)-! = Ay. This settles all our assertions about Ay. 
To prove the measurability of 4~' we observe that it follows from 
Theorem C and what we have just proved that, if u(4) > 0, then 


{y: u((Q(4 X A))”) > 0} = Am. 


This proves that if u(4) > 0, then 4~' is measurable; if u(4) = 0, 
we may find a measurable set B of positive measure, disjoint from 
A, and deduce the measurability of 4~* from the relation 
A‘=(4UB)1?-B, 

What we have already proved implies that if Sx) = f(x), 
then / is measurable. If 4 and B are measurable sets, if f(y) = 
u((Q(4 X B))*), and if fo(y) = fo(y—*), then both fo and fo are 
measurable, and we have, by Theorem C, 

fo(y) = 2(Ay)xa(y). 


We have proved, in other words, that if 2(y) = u(4y), then A 
is measurable on every measurable set. and it follows that i has 
the same property. § 


Theorem E. Jf 4 and B are measurable sets of positive 
measure, then there exist measurable sets C, and Co of positive 
measure and elements x, ¥\, 2, aNd yo, such that 


x1C; a 4, nC; Cc B, Cox%_ C 4, Cayo cB. 


Proof. Since u(B) >0 implies that p(B) > 0, it follows 
that (u X u)(4 & Bo) = p(A)u(B) > 0. Theorem C implies 
that x'4 N B is measurable for every x in X and of positive 
measure for at least one x in X. If x; is such that n(m1 14 N B) 
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> 0, and if y, = e, then, for C, = x,~'4 1 B, we have .x,C, c 4 
and yC, C B. 

Applying this result to 47! and B- we may find Co, xo, Yo 
so that xCo C A and yoCy C Bo, and we may write Cp = Cy, 
x2 = x0 ,y2= Jo | 


Theorem F. Jf 4 and B are measurable sets and if f(x) = 
u(x14 1 B), then f is a measurable function and 


ffdu = WDu(B». 


Df g(x) = uxA AB), and if € < u(4) + w(B), then the set 
{x: g(x) < e} is measurable. 


The first half of this result is sometimes known as the average 
theorem. 

Proof. The first assertion follows from the fact that if, as 
before, Q = S~'RS, then Q is a measure preserving transforma- 
tion of (X X X,S XS, u X yu) onto itself and 


f(x) = u(Q(4 X B™)).). 


If f(x) = f(x), then it follows that fis measurable. The second 
assertion is a consequence of this and the relation 


{xz g(x) < e} = fx: f(x) > 2(4) + o(B)- 9}. 


(1) Is the Cartesian product of two measurable groups a measurable group? 

(2) If w is a Haar measure in a compact group X of cardinal number greater 
than that of the continuum, then (X,S,u) is not a measurable group. (Hint: 
let D = {(x,y): « = y} = S(X X {e}). If Dis in S XS, then there exists a 
countable class R of rectangles such that De S(R). Let E be the [countable] 
class of sides of rectangles belonging to R. Since De S(E) X S(E), it follows 
that every section of D belongs to S(E). Since, however, by 5.9c, S(E) has 
cardinal number not greater than that of the continuum, this contradicts the 
assumption on the cardinal number of X.) 

(3) If » is a Haar measure on a locally compact group X, then, for every 
Baire set E and every « in X, the vanishing of any one of the numbers, 


ME), m(xE), w(Ex), and w(E~), 


implies the vanishing of all others. 

(4) If (X,S,u) is a measurable group such that y is totally finite, and if 4 
is any measurable set such that u(x4 — 4) = 0 for every x in X, then either 
u(4) = 0 or p(X — A) = 0. (Hint: apply the average theorem to 4 and 


262 HAAR MEASURE [Sec. 60] 


X — 4.) Properly formulated, this result remains valid even if p is not neces- 
sarily finite; in the language of ergodic theory it asserts that a measurable group, 
considered as a group of measure preserving transformations on itself, is metri- 
cally transitive. 

(5) If w is a Haar measure on a compact group X, then, for every Baire set 
E and every x in X, 


MCE) = w(xE) = w(Ex) = w(E~). 


§ 60. UNIQUENESS 


Our purpose in this section is to prove that the measure in a 
measurable group is essentially unique. 


Theorem A. J[f yp and v are two measures such that (X,S,u) 
and (X,S,v) are measurable groups, and if E in S is such that 
0 < o(E) < &, then, for every non negative measurable function 


f on X, 
frcrduts) = we) f 2 ay) 


The important part of this result, as far as it concerns the 
uniqueness proof, is its qualitative aspect, which asserts that 
every u-integral may be expressed in terms of a »—integral. 

Proof. If g(y) = f( =) /v(Ey), then our results in the preced- 
ing section imply that g is a non negative measurable function 
along with f. If, as before, we write 


S(x,y) = (x,xy) and T(x,y) = (yxy), 


then in the measure space (X X X,S X S, u X ») both the trans- 
formations S and T are measure preserving and, therefore, so is 
the transformation S~'T. Since S~!T(x,y) = (yx,x71), it fol- 
lows from Fubini’s theorem that 


WE) fe(y)do(y) = fxeCmdu(s) fg(v)do(9) = 
= frase v)dlu x ») G9) = 
= [fxeodeea(»)du(x) = 


= f g(x) o(Ex)du(x). 
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Since g(x—)v(Ex~!) = f(x), the desired result follows from a 
comparison of the extreme terms of this chain of equalities. J 


Theorem B. Jf u and v are two measures such that (X,S,u) 
and (X,S,v) are measurable groups, and if E in S is such that 
0 < o(E) < &, then, for every F in S, w(E)vX(F) = »(E)p(F). 


We remark that this result is indeed a uniqueness theorem; 
it asserts that, for every F in S, u(F) = cv(F), where c is the non 
ra 
v(E) 
a multiplicative constant. 

Proof. Let f be the characteristic function of F. Since 
Theorem A is true, in particular, if the two measures » and » 
are both equal to », we have 


negative finite constant , i.e. that w and » coincide to within 


_ f(y) 
Jreane) = 1B) [Trey a). 


Multiplying by u(Z) and applying Theorem A, we obtain 


wl) [7s)dv(s) = (EB) ff)duls). 1 


Theorem C. Jf » and v are regular Haar measures on a 
locally compact topological group X, then there exists a positive 
finite constant c such that p(E) = cv(E) for every Borel set E. 


Proof. If So is the class of all Baire sets in X, then (X,So,u) 
and (X,So,v) are measurable groups and hence, by Theorem B, 
u(E) = cv(E) for every Baire set E, with a non negative finite 
constant c; the fact that ¢ is positive may be inferred by choosing 
E to be any bounded open Baire set. Any two regular Borel 
measures (such as yu and cv) which coincide on Baire sets coincide 
also on all Borel sets; (cf. 52.H). 


(1) The Haar measure of the multiplicative group of all non zero real numbers 
is absolutely continuous with respect to Lebesgue measure; what is its Radon— 
Nikodym derivative? 

(2) If X and Y are two locally compact groups with Haar measures pw and » 
respectively, and if \ is a Haar measure in X X Y, then, on the class of all Baire 
sets in X X Y,A is a constant multiple of up X ». 

(3) The metric transitivity established in 59.4 may be used to prove the 
uniqueness theorem for measurable groups with a finite measure. Suppose 
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first that « and p are left invariant measures such that y< yu; then there exists a 
non negative integrable function f such that 


o(E) =f fledu(x) 
E 
for every measurable set EZ. It follows that 
= = -1 (; 
(9B) =f fladdule) = ff faut), 


and hence, since v is left invariant, that f(x) = f(y~'x) [vl If M = 
{x: f(x) < ¢}, then 


By, — Ni) = (la f(y x) < t} — [es f(x) < 4}) = 0, 


and hence, for each real number f, either n(N,) = 0 or u(N,’) = 0. Since this 
implies that f is a constant a.e. [ul], it follows that »y = cu. To treat the general, 
not necessarily absolutely continuous, case, replace u by » +». Just as in 59.4, 
these considerations may be extended to apply to not necessarily finite cases 
also. 

(4) If (X,S,u) is a measurable group and if E and F are measurable sets, then 
there exist two sequences {xn} and { yn} of elements of X and a sequence {4,} 
of measurable sets such that (a) the sequences {xn4n} and { yn4n} are disjoint 
sequences of subsets of E and F respectively, and (b) at least one of the two 
measurable sets, 


Ey = E =, Unni xn4n and Fo oe F 4 Utes rndn, 


has measure zero. (Hint: if either E or F has measure zero, the assertion is 
trivial. If both E and F have positive measure, apply 59.E to find 4, 71, 1 
so that u(41) > 0, m41C E, ¥:4,C F. If either E — x:4, or F — 914; has 
measure zero, the assertion is true; if not, then 59.E may be applied again, and 
the argument may be repeated by countable but possibly transfinite induction.) 

Since this result is valid for all left invariant measures, it may be used to 
give still another proof of the uniqueness theorem. It may be shown that if 
u and » are both left invariant measures, then the correspondence which, for 
every measurable set £, assigns u(Z) to v(E), is an unambiguously defined, one 
to one correspondence between the set of all values of u and the set of all values 
of vy. A more detailed, but not particularly difficult, examination of this corre- 
spondence yields the uniqueness theorem. 

(5) Let » be a regular Haar measure on a locally compact group X. Since, 
for each x in X, the set function pz, defined for every Borel set E by u(Z) = 
u(Ex), is also a regular Haar measure, it follows from the uniqueness theorem 
that u(Ex) = A(«)u(Z), where 0 < A(x) < 

(5a) A(xy) = A(x)A(y); A(e) = 1. 

(5b) If « is in the center of X, then A(x) = 1. 

(5c) If x is a commutator, and hence, more generally, if x is in the commutator 
subgroup of X, then A(x) = 1. 

(5d) The function A is continuous. (Hint: let C be a compact set of positive 
measure and let € be any positive number. By regularity, there exists a bounded 
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open set U such that CC U and n(U) S$ (1 + eu(C). If Y is a neighborhood 
of e such that V = V-' and CV C U, and if xe V, then 


A(x)u(C) = w(Cx) S w(U) S (1 + €u(C) 


and - 
a = w(Ce—) S$ n(U) S$ (1 + en), 
so that ~~ S ACs) s1+ée) 


(Se) The results (5a) and (5d) yield another proof of the identity of left and 
right invariant measures on a compact group X, since they imply that A(X) 
is a compact subgroup of the multiplicative oe of positive real numbers. 


(5f) For every Borel set FE, p(Z7!) = Sx NG Aa aes by the unique- 


ness theorem for right invariant measures, u(E-) = ¢ Ix NPS Aes MHC*) for some 


positive finite constant c. This implies that f Se) d(x) = ¢ a d(x) 


for every integrable function f. Replacing f(x) by f(x), writing g(<—4) = 
f(x7')/A(x), and applying the last written equation to g in place of f, yields 
the result that 


1 
= ele Daule) =e fale aule). 
(5g) If T(x) is the right handed analog of A(x), i.e. if, for a right invariant 


measure p, I is defined by »(xE) = I'(x)r(Z), then P(x) = x 

(6) A relatively invariant measure on a locally compact group X is a Baire 
measure v, not identically zero, such that for each fixed x in X the measure pz, 
defined by ».(Z) = v(xE), is a constant non zero multiple of ». A necessary 


and sufficient condition that p be relatively invariant is that v(EZ) = f o(y)du(y), 
E 


where « is Haar measure and @ is a continuous representation of X in the multi- 
plicative group of positive real numbers. (Hint: if ¢ is non negative, continuous, 


and such that d(xy) = (x)b(y), and if »(Z) =f o(v)du(>), then 
v(xE) = fo a)dey) =f ooy)an(y) = 


=f, o@)6(»)duty) = (4)r(E). 


If, conversely, v(x) = o(x)v(E), then it follows (cf. (5)) that (xy) = d(x)o(y) 
and ¢ is continuous. Consequently #(E£) = o(y—)dv(y) may be formed, 
EB 


and by the uniqueness theorem # = yp.) 

(7) If w is a o-finite, left invariant measure on the class So of Baire sets in a 
locally compact group X, then p is a constant multiple of the Baire contraction 
cf Haar measure, and hence, in particular, u is finite on compact sets. (Hint: 
if wis not identically zero, then (X,So,u) is a measurable group.) 


Chapter XII 


MEASURE AND TOPOLOGY IN GROUPS 


§61. TOPOLOGY IN TERMS OF MEASURE 


In the preceding chapter we showed that in every locally com- 
pact group it is possible to introduce a left invariant Baire measure 
(or a left invariant, regular Borel measure) in an essentially 
unique manner. In this chapter we shall show that there are 
very close connections between the measure theoretic and the 
topological structures of such a group. In particular, in this 
section we shall establish some of the many results whose total 
effect is the assertion that not only is the measure determined by 
the topology, but that, conversely, all topological concepts may 
be described in measure theoretic terms. Throughout this sec- 
tion we shall assume that 


X is a locally compact topological group, u is a regular Haar 
measure on X, and p(E,F) = u(E A F) for any two Borel sets 
E and F. 


Theorem A. Jf E is a Borel set of finite measure and if 
F(x) = o(xE,E), for every x in X, then f is continuous. 


Proof. If « > 0, then, because of the regularity of wu, there 


; € : 
exists a compact set C such that p(E£,C) < ri and there exists an 


open Borel set U containing C such that p(U,C) < = Let V be 


a neighborhood of e such that ¥ = Vand VC CU. IfyxeV, 
266 
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then «~y e V and therefore 
e(xC,yC) = w(x — yC) + u(y — xC) = 
= wy ek — C) + ule “yO -C)s 


SwWVC-C)SsmwWU-Q< 
It follows that 
| p(xE,E) ~ p(yE,E)| S 
S p(xE,yE) S p(xE,xC) + p(xCyC) + p(yCyE) <e | 


Theorem A implies that for every Borel set E of finite measure, 
and for every positive number ¢, the set {x: p(xE,E) < e} is 
open. Our next result shows that there are enough open sets of 
this kind. 


Theorem B. If U is any neighborhood of e, then there exist 
a Baire set E of positive, finite measure and a positive number 
e such that {x: p(xE,E) < e} CU. 


Proof. Let VY be a neighborhood of e such that VY-! c U 
and let E be a Baire set of positive, finite measure such that 
EcV. If eis such that 0 < € < 2u(£), then 


{x: p(xE,E) < e} C{xrxE NE #0} = EE?CVV'CU. J 


Theorems A and B together imply that the class of all sets of 
the form {x: p(xE,E) < e} is a base at e, and hence that it is 
indeed possible to describe all topological concepts in measure 
theoretic terms. To illustrate in detail how such descriptions are 
made, we proceed to give a measure theoretic characterization 


of boundedness. 


Theorem C. 4 necessary and sufficient condition that a set 
A be bounded is that there exist a Baire set E of positive, finite 
measure and a number ¢,0 S € < 2n(E), such that 


A C {x: p(«E,E) S e}. 


Proof. In order to prove the sufficiency of the condition, we 
shall show that if E is a Baire set of positive, finite measure, and 
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if OS € < 2u(E), then the set {x: p(xE,E) S e} is bounded. 
Let 5 be a positive number such that 45 < 2u(E) — e¢, and let 
C be a compact subset of E such that u(Z) — 6 < w(C). It fol- 
lows that 


p(xC,C) S p(xCyxE) + p(xE,E) + p(E,C) < 25 + p(xE,E) 
and hence that 
{x: p(xE,E) S e} C {x: p(xC,C) S € + 28}. 
Since € + 25 < 2y(C), it follows that 
{x: p(xE,E) S e} C {x: u(xC NC) #0} CCC. 


To prove the necessity of the condition, let C be a compact set 
such that 4 C C and let D be a compact set of positive measure. 
Suppose that the Baire set E of positive, finite measure is selected 
so that E> C'1DUD. Since DCE, and since, for x in C, 
DcxC "DcrE, it follows that, for x in C, DOC *ENE. 
This implies that 


ACCC {x: DCxEN E} c {x: p(xE,E) S ¢}, 
where € = 2(u(E) — w(D)). 


(1) The analogs of Theorems A, B, and C with n(xE MN F) in place of p(xE,E) 
are true, where E and F are Baire sets of positive finite measure. 

(2) If, for a fixed Borel set E of finite measure, f(x) = xE, then f is a con- 
tinuous function from X to the metric space of measurable sets of finite measure. 

(3) If E is any Borel set of positive measure, then there exists a neighborhood 
U of e such that UC EE“. 

(4) X is separable if and only if the metric space of measurable sets of finite 
measure is separable. 

(5) If Eis any bounded Borel set, and if, for every x in X and every bounded 
neighborhood U of e, 

u(E N) Ux) 


Su(«) = Ue 


then fy converges in the mean (and therefore in measure) to xz as U > e. 
In other words, for every positive number € there exists a bounded neighborhood 


V of e such that if U C PV, then {| fu — xe |du <e. This result may be called 
the density theorem for topological groups. (Hint: let V be a neighborhood of 


@such that ify e V, then p(yZ,ZE) < : . IfUC Vand if Fis any Borel set, then 
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27 WO) SS | xz(yx) — xe(x) |du(x)du(y) 2 


=| J du(x) S egy tee 9)Ma(9) — — f xa(e rants) f° a = 


= If Yule) — xe() a(x) |. 


u(A) _ u(Ax) 
[Recall that "B) = T(Bs) 
The desired conclusion follows upon applying this result first to 


= {x: fu(x) — xe(x) > 0} 


and then to F = {x:fu(x) — xz(x) < 0}.) 

(6) If » is any finite signed measure on the class of all Baire sets in X, then 
(cf. 17.3) there exists a Baire set N, such that »(EZ) = (EN N,) for every 
Baire set E. If and v are any two such finite signed measures, their convolution 
\ * py is defined, for every Baire set E, by 


for any Borel sets 4 and B and every x in X; cf. 60.5.] 


A *v)(Z) = xE(xy)d(A X v) (x, 9). 


N\XNy 


If X and » are the indefinite integrals (with respect to Haar measure pz) of the 
integrable functions f and g respectively, then \ # v is the indefinite integral of 
A, where 


(9) = [fee ) dul). 
(7) Ifd and » are finite signed measures (cf. (6)), then 
Ox )(E) =f ve tBAN(e), 
If the group X is abelian, thenXN* y = v#A. 


(8) If X and » are finite measures on the class of all Baire sets of a locally 
compact, o-compact, abelian group X, then fA(eE)av(x) = foe —)dr(x). 


(Hint: if (EZ) = »(E7), then 
f AxE)dv(x) = f AMxE\dr(x) and [r(xE—)dr(x) = f B(x-1E)dN(x); 


the desired result follows from the relation A * } = D* d.) 
(9) If f and g are two bounded, continuous, monotone functions on the real 
line, then (cf. 25.4) 


[fee + [[2df = £020) - S@s@, 
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i.e. the usual equation for integration by parts is valid. (Hint: let \ and v be 
the measures induced by f and g respectively, and apply (8) with E = 
{x: -0 <* <0}, 


§62. WEIL TOPOLOGY 


We have seen that every locally compact group in which measur- 
ability is interpreted in the sense of Baire is a measurable group, 
and that, moreover, the topology of the group is uniquely deter- 
mined by its measure theoretic structure. In this section we shall 
treat the converse problem: is it possible to introduce a natural 
topology in a measurable group so that it becomes a locally com- 
pact topological group? We shall see that the answer is essentially 
affirmative; we proceed to the precise description of the details. 

Throughout this section we shall work with a fixed measurable 
group (X,S,u); as usual, we shall write p(Z,F) = p(E AF) for 
any two measurable sets E and F. We shall denote by A the 
class of all sets of the form EE, where E is a measurable set of 
positive, finite measure, and by N the class of all sets of the form 
{x: p(xE,E) < e}, where E is a measurable set of positive, finite 
measure and e is a real number such that 0 < ¢ < 2y(E). 


Theorem A. Jf N = {x: p(xE,E) < e} © N, then every 
measurable set F of positive measure contains a measurable 
subset G of positive, finite measure such that GG" CN. 


Proof. It is sufficient to treat the case in which F has finite 
measure. If T(x,y) = (yx,y), then T(E X F) is a measurable 
set of finite measure in X X X and hence there exists a set 4 
in X X X such that 4 is a finite union of measurable rectangles 
and 


qu) > e(T(E X F),A) = 


= fi if | xrcexm (9) — xaleyy) [du(x)du(y) 2 


> f f | xe(y7) — xaxyy) [du(x)du(y). 
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If we write C = {>:fl xe(y— x) — xa(x,y) |du(x) = ‘| , then 
it follows that u(F N C) S gu(F) and hence that 
u(F — C) 2 gu(F) > 0. 
If y e F — C, then 


pyB,4") = fi xe(y) — xalsyy) Iduls) < §- 


Since J is a finite union of measurable rectangles, there are only 
a finite number of distinct sets of the form 4%; we denote them 
by 4, -+:, 4n. What we have proved may now be expressed 
by the relation 


F—Cc Uf: {: p(yE,A;) < <I. 
Since ;< u(E) = u(y) it follows from 59.F that each of the 


sets | y: p(yE Ai) < ‘| is measurable, and therefore, since 


u(F — C) > 0, that at least one of them intersects F — C in a 
set of positive measure. We select an index i such that if 


G=(F-oO)n | 9: oy, < |, 


then u(Gp) > 0. Itis clear that Go is a measurable set of positive, 
finite measure and that Gp C F. If y, e Gp! and ye e Go, then 
p(y1¥2 EE) = p(y" Ey 7£) S 
S p(y2£,A;) + p(y 1£,A) <6, 

so that Go-!Gy C N. We have proved, in other words, the 
existence of a set Gy satisfying all the requirements of the theorem 
except that instead of GG-! c N we have Gy 1G, C N. If we 
apply this result to Ff’ in place of F, and if we denote the set so 
obtained by G~}, then the set G satisfies all the requirements 
without exception. J 

Theorem A asserts in particular that every set in N contains a 
set in A. We shall also need the following result which goes in 
the opposite direction. 


272 MEASURE AND TOPOLOGY IN GROUPS (Sec. 62] 
Theorem B. [ff 4 = EE ec Aand0 << 2y(£), and if 
N = {x: p(xE,E) < ¢}, 
then NeN andNCA. 


Proof. It is trivial that Ne N; to see that N Cc J, observe 
that Nc {x:xENE #0} = EE § 
Theorem C. If N = {x: p(xE,E) < e} © N, then N is a 
measurable set of positive measure. If p(E~*) < @, then 


w(N) < ». 
Proof: Since = (x: u(xE 1 E) > u(E) — ‘|, fiegaetece: 


ability of N follows from 59.F. To prove that p(N) > 0, we 
apply Theorem A. If G is a measurable set of positive measure 
such that GG"! C N, then, in particular, Gy“! C N for any y 
in G. The last assertion of the theorem follows from the relations 


(5 = <) u(N) S [wee N E)du(x) S$ 


< f u(xE  E)du(x) = w(E)\u(E“). 


Theorem D. If 4 and B are any two sets of A, then there 
exists a setéCin A such thatCC ANB. 


Proof. Let E and F be measurable sets of positive, finite 
measure such that 4 = EE and B = FF-}. By 59.E there 
exists a measurable set G of positive, finite measure and there 
exist two elements x and y in X such that 


Gx CE and GycF. 
If C = GG, then C ce A and 
C = (Gx)(Gx) 1 CA and C= (Gy)\(GyItcB. J 


Before introducing the promised topology in X, we need to 
define one more concept. We recall that our definition of a 
measurable group was motivated by the continuity properties of 
topological groups and ignored entirely the separation axiom 
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which is an essential part of the definition of a topological group. 
One way of phrasing the relevant separation axiom is this: if an 
element x of the group is different from e, then there exists a 
neighborhood U of e such that x e’ U. Guided by these con- 
siderations and 61.A and 61.B, we shall say that a measurable 
group X is separated if whenever an element x of the group is 
different from e, then there exists a measurable set E of positive, 
finite measure such that p(xZ,E) > 0. 


Theorem E. Jf X is separated, and if the class N is taken 
for a base at e, then, with respect to the induced topology, X 
ts a topological group. 


We shall refer to this topology of the measurable group X as 
the Weil topology. 


Proof. We shall verify that N satisfies the conditions (a), 
(b), (c), (d), and (e) of § 0. 

Suppose that xo is an element of X, xo ¥ e, and that E is a 
measurable set of positive, finite measure such that p(xoE,E) > 0. 
If € is a positive number such that 0 < € < p(xoF,E), then 
e<2u(Z). It follows that if N = {x:p(xE,E) < e}, then 
N eN, and, clearly, xo e’ N. 

If N and M are in N, then by Theorem A, there exist sets 4 
and Bin A such that 4 c Nand Bc M. By Theorem D, there 
exists a set C in A such that Cc 4 N B; by an application of 
Theorem B, we obtain a set K in N such that 


KcCcANBCNNM. 
If N = {x: p(xE,E) < e}, we write M = {¥: o(xE,E) < <. 


If x9 and yo are any two elements of M, then 
(xo. yo EE) s p(yo 'E,E) or p(%o*E,£) a 
= e(¥0E,E) + p(xoE,£) < €, 


and therefore VM c N. 
If N ¢ N and x e X, then by Theorem A, there exists a measur- 
able set E of positive, finite measure such that EE“' CN. 
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Applying Theorem B to the set (xE)(x£)—! in A, we may find a 
set M in N such that 


M Cc («xE)(xE)7 = x«EEx 7 © xNxO,. 


If, finally, N = {x:p(xE,E) < «} eN and if xo eN, then 
p(xoE,E) <«. Since e < 2u(Z), it follows that e — p(xoE,E) < 
2u(xoE) and hence that if 


M = {x: p(xxoE,xoE) < € — p(moE,E)}, 
then MeN. Since 
Nxo~! = {xxo7!: p(xE,E) < e} = {x: p(xxoE,E) < e}, 
we have, for every x in M, 
p(xxpE,E) S p(xxoEyxoE) + p(xoE,E) < 
< (e — p(%oE,E)) + p(xoF,E) = «. 
This implies that x e Nxo~! and hence that Mxo CN. J 


Theorem F. Jf X is a separated, measurable group, then X 
is locally bounded with respect to its Weil topology. If a 
measurable set FE. has a non empty interior, then p(E) > 0; 
if a measurable set E is bounded, then p(E) < o. 


Proof. Let No be an arbitrary set of finite measure in N 
(see Theorem C), and let My be a set in N such that MypMy7 C No. 
We shall prove that M, is bounded. If Mp is not bounded, then 
there exists a set N in N and a sequence {x,} of elements in Mo 
such that 

Xn41 e’ U?-1 «aN, i= 1, 2, eeeres, 


By Theorem A, there exists a measurable set E of positive, finite 
measure such that FE ¢ M,~! and EE-!c WN. Since the condi- 
tion on {x,} implies that the sequence {x,E} is disjoint, and 
since *nE Cc MypMy—! Cc No, it follows that p(No) = ~. Since 
this contradicts Theorem C, we have proved the first assertion 
of the theorem. 

The fact that a measurable set with a non empty interior has 
positive measure follows from Theorem C; the last assertion is 
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a consequence of Theorem C and the fact that, by definition, a 
bounded set may be covered by a finite number of left transla- 
tions of any setin N. J 

Theorem F is in a certain sense the best possible result in this 
direction. If, however, we make use of the fact that every 
locally bounded group may be viewed as a dense subgroup of a 
locally compact group, then we may reformulate our results in 
a somewhat more useful way. We do this in Theorem H;; first 
we prove an auxiliary result concerning arbitrary (i.e. not neces- 
sarily right or left invariant) Baire measures in locally compact 
groups. 


Theorem G. Jf wis any Baire measure in a locally compact 
topological group X, and if Y is the set of all those elements y for 
which w(yE) = wl) for all Baire sets E, then Y is a closed 
subgroup of X. 


Proof. The fact that Y is a subgroup of X is trivial. To prove 
that Y is closed, let yo be any fixed element of Y and let C be 
any compact Baire set. If U is any open Baire set such that 
yoC c U, then there exists a neighborhood V of e such that 
VyoC c U. Since Vyo is a neighborhood of yo, it follows that 
there exists an element y in Y such that ye Vy. Since yC Cc 
VyoC < U, it follows that 


u(C) = w(yC) S p(U), 


and hence, by the regularity of w, that u(C) S u(yoC). Applying 
this conclusion to yo~! and yoC in place of yo and C we obtain the 
reverse inequality, and hence the identity, u(C) = u(yoC) for all 
C. It follows that u(E) = u(yoF) for every Baire set E and hence 
that yc Y. ff 

By a thick subgroup of a measurable group we mean a sub- 
group which is a thick set; (cf. § 17). 


Theorem H. Jf (X,S,u) is a separated, measurable group, 
then there exists a locally compact topological group X with a 
Haar measure fi on the class § of all Baire sets, such that X is a 
thick subgroup of X, S28 NX, and w(E) = AE) whenever 
EcSandE=ENX. 
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Proof. Let X be the completion of X in its Weil topology; 
i.e. X is a locally compact group containing X as a dense subgroup. 
Consider the class of all those subsets £ of X for which E N X eS. 
It is clear that this class is a o-ring; in order to show that this 
o-ring contains all Baire sets, we shall show that it contains a base 
for the topology of X. 

Suppose that # is any point of X and U is any neighborhood 
of é in X; let 7 be a neighborhood of é such that PP? c U. 
Since ? N X is an open set in X, there exists a measurable open set 
W in X such that Wc P NX; since the topology of X is (by 
the definition of X) the relative topology it inherits from X, there 
exists an open set W in X such that VW = WN X. Since we may 
replace W by W N P, there is no loss of generality in assuming 
that ¥ <P. Since X is dense in X, there exists a point x in X 
such that x e £4; it follows that 


exh CAWOW C&POP & 20. 
If we define a by writing a(Z) = n(E N X) for every E in 8, 


then it is easy to verify that @ is a Baire measure in X. It follows 
from Theorem G, and the fact that a(x£) = a(£) whenever 
xeXand E c§, that fis left invariant. The uniqueness theorem 
implies therefore that @ coincides on § with a Haar measure in X. 
It follows that if £eS and EN X = 0, then a(£) = w(E N X) 
= 0,ie. that X is thickin X. J 


(1) Let X be any locally compact topological group with a Haar measure pu 
on the class S of all Baire sets. If ¥ = X X_X,ifS isthe class of all sets of the 
form E XX, where Ee §, and if f(E X X) = u(E), then (X,8,z) is a measur- 
able group which is not separated. To what extent is this example typical of 
non separated measurable groups in general? 

(2) If X is a separated measurable group, then a set £ is bounded with respect 
to the Weil topology of X if and only if there exists a measurable set 4 of posi- 
tive, finite measure such that EZ is contained in a measurable set of finite 
measure. 

(3) Is Theorem G true for Borel measures? 

(4) Is the subgroup Y described in Theorem G necessarily invariant? 

(5) Under the hypotheses of Theorem G, write f(x) = u(xE) for every x in 
X and every Baire set E. Is f a continuous function? 

(6) The purpose of the following considerations is to give a non trivial example 
of a thick subgroup. Let X be the real line and consider the locally compact 
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topological group X X X. A subset B of X is linearly independent if the 
conditions "Pirie; = 0, x:¢ B, i= 1, +++, n, with rational numbers 1;, 
imply that ry) =---=r, = 0. 

(6a) If E is a Borel set of positive measure in X X X and if B is a linearly 
independent set in X, of cardinal number smaller than that of the continuum, 
then there exists a point (x,y) in E such that B U {x} is a linearly independent 
set. (Hint: there exists a value of y such that E¥ has positive measure and 
therefore has the cardinal number of the continuum.) 

(6b) There exists a set C of points in X X X such that (i) CN E 0 for 
every Borel set E of positive measure in X X X, (ii) the set B of first coordinates 
of points of C is linearly independent, and (iii) C intersects every vertical line 
in at most one point. (Hint: well order the class of all Borel sets of positive 
measure in X X X and construct C, using (6a), by transfinite induction.) 

(6c) A Hamel basis is a linearly independent set B in X with the property 
that for every x in X there exists a finite subset {1, ---, *n} of B and a corre- 
sponding finite set {71, ++, rn} of rational numbers such that « = )°fly rexi. 
The expression of x as a rational linear combination of elements of B is unique. 
Every linearly independent set is contained in a Hamel basis. (Hint: use 
transfinite induction or Zorn’s lemma.) 

(6d) By (6b) and (6c) there exists a set C of points in X X X having the 
properties (i), (ii), and (iii) described in (6b) and such that the set B of first 
coordinates of points of Cis a Hamel basis. If « = }°?_, rix:, where 7; is rational 
and (xv) eG i= L «+m write fle) = Ltanye MZ = (Gy):7 =f) 
(i.e. if Z is the graph off), then Z is a thick subgroup of X X X. 


§63. QUOTIENT GROUPS 
Throughout this section we shall assume that 


X is a locally compact topological group and y» is a Haar 
measure in X; Y is a compact, invariant subgroup of X, 
v is a Haar measure in Y such that »(Y) = 1, and wis the pro- 
jection from X onto the quotient group X = X/Y. 


While most of the important results of this section are valid for 
closed (but not necessarily compact) subgroups Y, we restrict our 
attention to the compact case because this will be sufficient for 
our purposes and because the proofs in this case are slightly 
simpler. 


Theorem A. If a compact set C is a union of cosets of Y 
and if U is an open set containing C, then there exists an open 
set P in X such that 


Caert(Acu. 
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Proof. There is no loss of generality in assuming that U is 
bounded. If we write X) = UY, then it follows that Xo is com- 
pact. We assert that Xp is, just as UY, a union of cosets of Y. 
To prove this, suppose that x, e Xo and a(*) = (x2) (so that 
x1 1x2 e Y); we are to prove that x, © Xo. If V is any neighbor- 
hood of x2, then Vx2~1!x, is a neighborhood of x, and therefore 
UY 1 Vx.7'x, # 0. Since x; ~!xe € Y, it follows that 


UY NV = UY x,7*x. N Veg xx, xe = 
= (UY al Vr" x4) 1 Xe ad 0; 


since V is arbitrary, this implies that xg e Xo. 

The fact that Cis a union of cosets of Y implies that r(X>o — U) 
N (C) = r((Xp — U) NC) =O. Since the sets r(Xp — U) and 
7(C) are compact, and since 7(U) is an open set containing 7(C), 
there exists an open set ? in X such that 


m(C) CP Cxr(U) Cr(X) and PN xr(X — UV) =0. 
If xex—(P), so that (x) eP, then x(x) e r(Xo — U) and 


therefore x e’ X) — U. Since, however, x ¢ Xo, it follows that 
xe U and therefore that Cc m7(7) CU. § 


Theorem B. If C is a compact subset of X, then x—(C) 
is a compact subset of X; if E is a Baire set [or a Borel set] in 
X, then w—(£) is a Baire set or a Borel set\ in X. 


Proof. Suppose that K is an open covering of r~1(€). Since, 
for each ¥ in C, r—1({#}) is a coset of Y, and therefore compact, 
it follows that K contains a finite subclass K(#) such that 
ax ({#}) c U(#) = U K(&). By Theorem A there is an open 
set P(#) in X such that 


a({8}) © V(8) = x7 (P(8)) © US). 


Since C is compact, there exists a finite subset {4 ---, #n} of C 
such that C c UJ}., P(4,); it follows that 


aC) c UR Ya) C U2 UK), 
and hence that +~!(C) is compact. 


The assertion concerning Baire sets and Borel sets follows from 
the preceding paragraph and the additional facts that the inverse 
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image (under 7) of a Gs is a Gy, and that the class of all those sets 
in X whose inverse image lies in a prescribed o-ring is a o-ring. 

Theorem B implies that if measurability in both X and X is 
interpreted in the sense of Baire, or else in the sense of Borel, 
then the transformation w is measurable. In other words, m7! 
maps measurable sets satisfactorily; how does it map their meas- 
ures? 


Theorem C. Jf @ = ur, then pis a Haar measure in X. 


Proof. The fact that f is finite on compact sets and positive 
on non empty open Borel sets follows from the fact that the 
inverse image (under 7) of a compact set or a non empty open set 
is a compact set or a non empty open set, respectively. It remains 
only to prove that @ is left invariant. 

If £ is a Borel set in X and % &X, let xo be an element of X 
such that r(xo) = %. If « exon (EL), then (since 7 is a homo- 
morphism) x(x) © Sof, so that 


xom (LE) Cr (AB). 


If, conversely, x ew—?(%o£), then w(x) © 49 and therefore 1(xp~1x) 


= fy—14(x) eB. This implies that xo~'x e 7~1(£) and hence that 


x exor—(E). Since we have thus proved that 


= (So) Cc xo! (£), 


it follows that 
AoE) = pr (SL) = u(mom(£)) = pw (£) = a(Z). 
Theorem D. If f © &(X) and if 


glx) = f Slay)dv(y), 


then ge &4(X) and there exists a (uniquely determined) func- 
tion & in &4(X) such that g = gn. 


Proof. If /.(y) =/(xy), then the continuity of / implies the 
continuity, and hence integrability, of f, on Y. Since f is uni- 
formly continuous, to every positive number ¢ there corresponds a 
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neighborhood U of e such that if «1x2 ~! © U, then | f(x1) — f(x2) { 
<e. If x«,xg7! e U, then 


(*1.y) (x2)? = xyx27! © U 
and therefore 


| g(x) — g(x.) | S f fen) — f(xy) |d(y) <6 


so that gis continuous. Since g is clearly non negative and since 
{x: g(x) 0} c {x: f(x) ~ 0} -Y, it follows that ge £,(X). 

If w(x.) = w(xe), then x,~!xg e Y and it follows from the left 
invariance of » that 


g(w1) = f fley)dr(y) = f Sx (x1—!xey))do(y) = g(x). 


Consequently writing £(%) = g(x), whenever # = r(x), unam- 
biguously defines a function § on X; clearly g = gx. Since 
(39.A), for every open subset M of the real line, 


{#: 8(@) eM} = a({x: g(x) e M}), 


the continuity of $ follows from the openness of 7. Since x maps 
the bounded set {x: g(x) ~ 0} on a bounded subset of X, it 
follows that £ e £,(X); the uniqueness of % is a consequence of 
the fact that x maps X onto X. J 
Theorem E. If C is a compact Baire set in X and if 
g(x) = v(x C 0 Y), then there exists a (uniquely determined) 
Baire measurable and integrable function & on X such that 


g= fr. If Cis a union of cosets of Y, then f bdp = y(C). 


Proof. Let {f,} be a decreasing sequence of functions in 
£4.(X) such that lim, f,(*) = xe(«) for every x in X. If 


En(x) = f falxy)dr(y), 2 = 1,2, °°, 


then {g,} is a decreasing sequence of functions in £,(X) (cf 
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Theorem D), and hence (by, for instance, the bounded con- 
vergence theorem) 


lima ga) = f xe(xy)dr(y) = I tel y)dv(y) = 


= (eC NY) = g(x) 


for every x in X. 

By Theorem D, for each positive integer m there is a function 
&, in £4(X) such that g, = gar. Since the sequence {£,} is 
decreasing, we may write £(#) = lim, £,(%); clearly g = éx. 
Since (39.C) 


feda = fsdu = foet0 0 ¥)du(s), 


and since {x: v(x 1C NY) ~ 0} c {x:x 7C N Y #0} = CY, the 
integrability of g follows from the finiteness of ». 


If, finally, C is a union of cosets of Y, then 
Y if xeC 

as | = > 

es fe if xe’C, 


and therefore 
eda = fiedu = free 0 Yydu(s) = w(C). HI 


Theorem F. [f, for each Baire set E in X, 
ge(x) = vee TEN Y), 


then there exists a (uniquely determined) Baire measurable 
function $2 on X such that Se = Ser. 


Proof. We observe first that (by the definition of topology in 
Y) the set x-'E N Yis always a Baire set in Y and, consequently, 
that ge(x) is always defined. 

If we denote by E the class of all those sets E for which the 
desired conclusion is valid, then, by Theorem E, it follows that 
every compact Baire set belongs to E. Since the elementary 
properties of the (finite) measure v imply that E is closed under 
the formation of proper differences, finite disjoint unions, and 
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monotone unions and intersections, it follows that E contains all 
Baire sets. J 


Theorem G. If, for each Baire set E in X, &g is the unique 
Baire measurable function on X for which 


&u(a(x)) = v(x TE 1 Y) = ge(x) 


for every x in X, then 


feeda = w) 


for every Baire set E. 


Proof. Write \(E) = f eda = f PO OE Wi Vidioy fonevery 


Baire set Ein X. Since \(C) is finite for every compact Baire set C 
(Theorem E) and since } is clearly non negative, we see that Nisa 
Baire measure in X. If xg © X, then 


(oH) = fgaele)du(x) = f y(e—lxoE N Y)du(x) = 
= [v((%o2) AE 01 Y)dul) = fige( 0) dul) = 


eZ f ge(x)du(x) = (E), 


so that dis left invariant. It follows from the uniqueness theorem 
that \(E) = cy(£) fora suitable constant c. Since, by Theorem E, 
d(C) = u(C) whenever C is a compact Baire set which is a union of 
cosets of Y, and since there exist such sets with u(C) > 0, it follows 
thatc=1. J 


§ 64. THE REGULARITY OF HAAR MEASURE 


The purpose of this section is to prove that every Haar measure 
is regular. Throughout this section, up to the statement of the 
final, general result, we shall assume that 
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X is a locally compact and o-compact topological group, 
and up is a left invariant Baire measure in X which is not 
identically zero (and which, therefore, is positive on all non 
empty open Baire sets). 


It is convenient to introduce an auxiliary concept; by an invariant 
o-ring we shall mean a o-ring T of Baire sets such that if E eT 
and x eX, then xE eT. Since the class of all Baire sets is an 
invariant o-ring, and since the intersection of any collection of 
invariant o-rings is itself an invariant o-ring, we may define the 
invariant o-ring generated by any class E of Baire sets as the 
intersection of all invariant o-rings containing E. 


Theorem A. Jf E is a class of Baire sets and T is the 
invariant o-ring generated by E, then T coincides with the 
o-ring To generated by the class {xE:x eX, E cE}. 


Proof. Since «EZ eT for every x in X and every E in E, it 
follows that Ty CT; it is sufficient therefore to prove that To 
is invariant. Let xo be any fixed element of X. The class of all 
those Baire sets F for which xof © To is a o-ring; since, for every 
x in X and every E in E, xo(xE) = (xox)E € To, it follows that 
this o-ring contains Ty. We have proved, in other words, that if 
FeTo, then xf eT, § 


Theorem B. Jf E is a countable class of Baire sets of finite 
measure and T is the invariant o-ring generated by E, then the 
metric space 3 of all sets of finite measure in T (with the metric 
p defined by p(E,F) = p(EA F)) is separable. 


Proof. Since every subspace of a separable metric space is 
separable, it is sufficient to prove that there exists a o-ring To 
of Baire sets such that T C Ty and such that Ty has a countable 
set of generators of finite measure; (cf. 40.B). Since X is a Baire 
set, it follows that X X E is a Baire set in X X X for each E 
in E. If we write, as before, S(x,y) = (x,xy), then S(X X £) is 
also a Baire set in X X X for each E in E. Consequently there 
exists, for each E in E, a countable class Rz of rectangles of finite 
measure such that S(X X E) eS(Rg). If we denote by Ty the 
o-ring generated by the class of all sides of all rectangles in all 
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Rg, E cE, then clearly 
S§ (X x E) e To x To 


for every Ein E. Since every section of a set in Ty X To belongs 
to To, it follows that, for every x in X and every E in E, 


xE = x(X X E), = (S(X X E))z € To, 
and hence (by Theorem A) that TC Tp. Jf 


Theorem C. Jf T is an invariant o-ring, if f is a function 
in & which is measurable (T), and if y in X is such that 
p(yE,E) = 0 for every E in T, then f(y—'x) = f(x) for every 
xin X, 


Proof. If E is any set of finite measure in T, then 
0 = ol yE,E) = fil xv(e) — xe(%) Idu(e) = 
= fixe) = xe(0) [du). 
It follows that 
fie) - g@) [du = 0 


for every integrable simple function g which is measurable (T), 
and hence, by approximation, that f | f(y x) — f(x) |du(x) = 0. 
Since the integrand of the last written integral belongs to £4, 
the desired conclusion follows from 55.B. § 


Theorem D. Jf T is an invariant o-ring generated by its 
sets of finite measure and containing at least one bounded set of 
positive measure, if E is a class of sets dense in the metric space 
of sets of finite measure in T, and if 


Y = {y: p(yE,E) = 0, E © E}, 
then Y is a compact, invariant subgroup of X. 


Proof. If Yo = { y:p(yE,E) = 0, E eT}, then clearly Yo C Y. 
On the other hand if Ep is a set of finite measure in T and eis a 
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positive number, then there exists a set EZ in E such that 


p(Eo,E) < oe It follows that if y e Y, then 


0 S p(yE,Eo) S p(yEo,yE) + o( yE,E) + p(E,Eo) < «. 


Since ¢ is arbitrary, this implies that y e Yo, and hence that 
Y = Yo. 
If y,; and ye are in Y and E is in T, then 


0 S o(y1 ~y2F,E) S p(y. *y2F, yeE) + o(yek,£). 


Since yo eT and p(y, yok, yeE) = p(yoE yi yek), it follows 
that y,~'y2 © Y, so that Y is indeed a subgroup of X. If y e Y, 
xeX, and EeT, then xE eT and therefore p(x—yxZ,E) = 
p(yxE,xE) = 0, so that Y is invariant. 

If Ey is a bounded set of positive measure in T, then the fact 
that p(yFo,Fo) = 0 for every y in Y implies that yEyo N Ey ¥ 0. 
It follows that y e EyEo~! and hence that Y is contained in the 
bounded set EyEy~'. To prove, finally, that Y is closed (and 
therefore compact) we observe that 


Y = f\ece{y: o(yE,E) = 0}; 
the desired result follows from 61.A. J 


Theorem E. Jf E is any Baire set in X, then there exists a 
compact, invariant Baire subgroup Y of X such that E is a 
union of cosets of Y. 


Proof. Let {C;} be a sequence of compact Baire sets, of which 
at least one has positive measure, such that FE eS({C,}). For 
each i, let {/;;} be a decreasing sequence of functions in £4(X) 
such that lim; /;;(«) = x¢,(x) for every x in X. For each positive 
rational number r, the set {x:f;;(*) 2 r} is a compact Baire set; 
let T be the invariant o-ring generated by the class of all sets of 
this form. It follows from Theorem B that the metric space of 
all sets of finite measure in T is separable; let { Z,} be a sequence 
which is dense in this metric space. If 


Y = f)r-1 {y: P(yEn,En) = O}, 
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then it follows from Theorem D that Y is a compact invariant 
subgroup of X and it follows from 61.A that Y is a Baire set. 

Since each /;; 1s measurable (T), Theorem C implies that 
fii(y x) = f(x) for every y in Y and every x in X. It follows 
that x¢,(y~'x) = xe,(x), ie. that yC; = C,;, for every y in Y 
and every i = 1, 2, ---. Since, for each y in Y, the class of all 
those sets F for which yF = F is a o-ring, it follows that yE = E 
for every y in Y. Hence E = YE = U..2 Ys; that is, E isa 
union of cosets of the invariant subgroup Y. Jj 


Theorem F. [f {e} is a Baire set, then X is separable. 


Proof. Let {U,} be a sequence of bounded open sets such that 
{e} = ()%.1 Un; we have seen before that there is no loss of 
generality in assuming that 


Cag C Un, mn =1,2,-°-. 
There exists a sequence {C;} of compact sets such that X = 
U?-1 Ci; since each C; is compact there exists, for each i and 2, 
a finite subset {x,;°"} of C; such that C; C Uj «i;5Un. We shall 
prove that the countable class {x,;‘"U,,} is a base. 
We prove first that if U is any neighborhood of e, then there 
exists a positive integer 7 such that ee U, C U. Indeed since 


{e} a N, Un = ft), U, and eeU, 


it follows that a 
1, (U, — U) = (f), U,) — U = 0. 


Since {0,, — U} is a decreasing sequence of compact sets with an 
empty intersection, it follows that U, — U(c U, — U) is empty 
for at least one value of 7. 

Suppose now that « is any element of X and V is any neighbor- 
hood of x. Since «7'V is a neighborhood of e, there exists a 
neighborhood U of e such that U"'U c x'V, and, by the 
preceding paragraph, there is a positive integer ” such that 
eeU, CU. Since x e U1 C;, there is a value of i such that 
x eC; and therefore there is a value of 7 such that x ex;;‘"U,. 
Since the last written relation implies that x, © xU,~', we have 


pee UU, CxUe UU, C20 VU ew F=f, J 
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Theorems E and F together yield the following startling and 
useful result. 


Theorem G. If E is any Baire set in X, then there exists a 
compact, invariant subgroup Y of X such that E is a union of 
cosets of Y, and such that the quotient group X/Y is separable. 


Proof. By Theorem E, there exists a compact, invariant Baire 
subgroup Y such that E is a union of cosets of Y. If {U,} is a 
sequence of open sets such that Y = ()7., U,, then, for each 
positive integer 7, there exists an open set U, in the quotient 


group X = X/Y such that 
You "(Un) Cc Un 


where 7 is the projection from X onto X; (cf. 63.A). It follows 
that Y = ()-17—(U,) and hence that {¢} = ()2_,U,; the 
separability of X is now implied by Theorem F. § 


Theorem H. Every Haar measure in X is completion 
regular. 


Proof. It is sufficient to prove that if U is any bounded open 
set, then there exists a Baire set E contained in U such that 
U — E may be covered by a Baire set of measure zero. Given 
U, we select the Baire set E (C U) so that u(£) is maximal; by 
Theorem G there exists a compact invariant subgroup Y of X 
such that F is a union of cosets of Y and such that the quotient 
group X(= X/Y) is separable. 

Let + be the projection from X onto X, and write 
F = r7'a(U — E); we shall prove that F is a Baire set of meas- 
ure zero. The fact that E is a union of cosets of Y implies that 
n(U — E) = x(U) — z(E); since 1(U) is an open set in a separa- 
ble space, r(U) is a Baire set in X; (cf. 50.E). It follows from the 
relation F = r—!x(U) — E that F is indeed a Baire set. 

Since the Baire open sets of X form a base, corresponding to 
each point x in U — E there is a Baire open set /’(«) such that 
xeV(x) CU. Since {r(V(x)): « e U — E} is an open covering 
of x(U — E), it follows from the separability of X that there 
exists a sequence {x,} of points in U — E such that 


mU — E) C Us (Vs). 
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Since r(U — E) = x(U) — x(E), we have 


mMU-—E)c (U: a(V(«;))) — (EZ) = Uir(V(«) — 2). 


It follows from these considerations that it is sufficient, in order 
to complete the proof of the theorem, to prove that 


ar'(r(V — E))) = 0 


for every Baire open set V contained in U; we turn therefore to 
the proof of this result. (Observe that the reasoning, used above 
to show that r~'w(U — E) is a Baire set, may also be applied to 
V in place of U.) 

If V is a Baire open set contained in U, then it follows from the 
maximal property of E that u(V — EZ) =0. If » is a Haar 
measure in Y such that »(Y) = 1, and if we write @ = ur! and 
g(x) =» I(V — E) NY), then (63.G) there exists a (non 
negative) Baire measurable function ¢ on X such that g = gr 
and such that 


0 = WV — E) = fgdn = 


= fede | vor - BN Y)du(x) 2 0. 
- We have 
x1V-E)NY=(@7V NY) —-@ICENY). 


If xe V, then eex VN Y and if x e’ E, then x 7E N Y =0, 
so that ifx eV — EF, then x '(V — E) N Yis anon empty open 
subset of Y. It follows that if x ex—'1(V — E), so that r(x) = 
a(xo) for some xp in V — E, then 


g(x) = £(r(x)) = &Gr(x0)) = g(%o) > 9, 
and therefore, by 25.D, u(r x(V — Z)) =0. § 


TheoremI. Jf X is an arbitrary (not necessarily o—compact) 
locally compact topological group, and if w is a left invariant 
Borel measure in X, then y is completion regular. 
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Proof. Given any Borel set E in X, there exists a o~compact 
full subgroup Z of X such that E c Z. By Theorem H, y» on Z 
is completion regular and therefore there exist two sets 4 and B 
in Z which are Baire subsets of Z and for which 


ACECB and w(B— A =0. 


Since Z is both open and closed in X, 4 and B are also Baire 
subsets of X. 
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Cavalieri’s principle, 149 
Center, 5 
Characteristic function, 15 
Class, 10 
Closed sets, 3 
Closure, 3 
Coefficient of correlation, 196 
Collection, 10 
Compact sets, 4 
Complement, 16 
Complete: 
Boolean ring, 169 
measure, 31 
Completely regular spaces, 5 
Completion: 
of a measure, 55 
of a topological group, 8 
regular measure, 230 
Complex measure, 120 
Conditional: 
expectation, 209 
probability, 195, 207 
probability as a measure, 210 
Content, 231 
Continuity and additivity: 
of infinite valued set functions, 
40 
of set functions on rings, 39 
of set functions on semirings, 


Continuity from above and below, 


Continuous transformations, 5 
Convergence: 
a.e., 86 
a.e. and in measure, 89, 90 
in measure, 91 
in the mean, 103 
of a series of sets. 19 


INDEX 


Convergence (Cont.): 

of sequences of measures, 170 
Convex metric spaces, 169 
Convolution, 269 
Coset, 6 
Countably: 

additive, 30 

subadditive, 41 
Cylinder, 29, 155 


Decreasing sequences: 
of partitions, 171, 
of sets, 16 
Dense: 
sequences of partitions, 171 
sets, 3 
Density theorem, 268 
Derivatives of set functions, 133 
Difference: 
of two sets, 17 
set, 68 
Dimension, 152 
Discontinuity from above of regu- 
lar outer measures, 53 
Discrete topological space, 3 
Disjoint, 15 
sequences of sets, 38 
Distance between integrable func- 
tions, 98 
Distribution function, 80 
Domain, 161 
Double integral, 146 


Egoroff’s theorem, 88 
on a set of infinite measure, 90 
Elementary function, 86 
Empty set, 10 
Entire space, 9 
Equal sets, 10 
Equicontinuous, 108 
Equivalent: 
sequences of functions, 201 
signed measures, 126 
Essentially bounded functions, 86 


Essential supremum, 86 

Exhaustion, 76 

Extended real number, 2 

Extension of measures, 54 
to larger o-rings, 71 


Fatou’s lemma, 113 
Finite: 


and totally finite measure spaces, 


intersection property, 4 

measure, 31 
Finitely: 

additive, 30 

subadditive, 41 
Fubini’s theorem, 148 
Full: 

sets, 52, 132 

subgroups, 250 
Fundamental: 

in measure, 91 

in the mean, 99 

sequence, 87 


Generated: 
hereditary class, 41 
invariant o-ring, 283 
monotone class, 27 
ring, 22 
o-ring, 24 

Graph, 143 

Group, 6 


Haar measure, 251 
Hahn decomposition, 121 
Hamel basis, 277 
Hausdorff: 

measure, 53 

space, 4 
Hereditary class, 41 
Hélder’s inequality, 175 
Homeomorphism, 5 
Homomorphisn, 6 
Horizontal line, 131 


INDEX 


Identity, 6 
Image, 161 


Increasing sequence of sets, 16 


Indefinite integral, 97 
Independent: 
functions, 192 
sets, 191 
Induced: 
Borel measure, 234 
inner content, 232 
measure, 47 
outer measure, 42, 233 


Inequivalence of two definitions of 
absolute continuity, 128 


Inf, 12 
Inferior limit, 16 
Infimum, 1 
Inner: 
content, 232 
measure, 58 
regular content, 239 
regular set, 224 
Integrable: 
functions, 102 
simple functions, 95 
Integral, 95, 102 
Integration by parts, 269 
Interior, 3 
Intersection, 13 
Into, 161 
Invariant: 
sets, 29 
o-rings, 283 
subgroups, 6 
Inverse, 6 
Inverse image, 76, 161 
Isomorphism, 167 
Iterated integral, 146 


Jacobian, 164 
J-cylinder, 155 
Join, 14 
Jordan decomposition, 123 
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Kolmogoroff’s inequality, 196 


Lattice of sets, 25 
Lebesgue: 
decomposition, 134 
integral, 106 
measurable function, 78 
measurable set, 62 
measure, 62, 153 
-Stieltjes measure, 67 
Left: 
Haar measure, 252 
invariance, 252 
translation, 6 
Lim inf, limit, lim sup, 16 
Linear functional: 
on £&, 243 
on £o, 178 
Linearly independent sets, 277 
Locally: 
bounded, 8 
compact, 4 
Lower: 
ordinate set, 142 
variation, 122 
Lusin’s theorem, 243 


Mean: 
convergence, 103 
fundamental, 99 
value theorem, 114 
Measurability preserving trans- 
formation, 164 
Measurable: 
cover, 50 
function, 77 
function of a measurable func- 
tion, 83 
group, 257 
kernel, 59 
rectangle, 140, 154 
set, 73 
set which is not a Borel set, 67, 
83 


Measurable (Cont.): 
space, 73 
transformation, 162 
Measure, 30 
algebra, 167 
on intervals, 35 
preserving transformation, 164 
ring, 167 
in metric spaces, 40 
space, 73 
Meet, 14 
Metric: 
outer measures, 48 
spaces, 5 
Minkowski’s inequality, 176 
Modulo, 127 
Monotone: 
class, 27 
class generated by a ring, 27 
functions of a real variable, 179 
sequences, 16 
set functions, 37 
Multiplication theorem, 195 
Mutually singular, 126 
p*-measurable sets, 44 
u*-partition, 48 
u-partition, 31 


Negative: 
part, 82 
sets, 120 
Neighborhood, 3 
Non atomic, 168 
Non coincidence of complete o-ring 
and o-ring of »*-measurable 
sets, 58 
Non measurable sets, 69 
Non product measures in product 
spaces, 214 
Non regular: 
measures, 231 
outer measures, 52, 53, 72 
Non term by term integrability, 
111, 112 


INDEX 


Non uniqueness of extension, 57 
Normal class, 28 

Normalized, 171 

Normal numbers, 206 

Norn, 171 


One-point compactification, 4 
One to one, 161 
measurable transformation 
which is not measurability 
preserving, 165 
Onto, 161 
Open: 
covering, 4 
set, 3 
transformation, 5 
Outer: 
measure, 42 
measures on metric spaces, 48 
regular sets, 224 


Partition, 31, 47, 171 
Point, 9 
at infinity, 240 
Positive: 
linear functional, 243 
measure, 166 
part, 82 
sets, 120 
Principle of duality, 17 
Probability measures and spaces, 
191 
Product: 
measures, 145 
of a sequence of measures, 157 
of partitions, 32, 48 
of transformations, 161 
Projection, 6 
Proper difference, 17 
Purely atomic, 182 


Quotient group, 6 


Rademacher functions, 195 
Radius, 5 
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Radon-Nikodym theorem, 128 
counter examples to generaliza- 
tions, 131 
Range, 161 
Rectangle, 137, 150, 154 
Regular: 
contents, 237 
measures, 224 
outer measures, 52 
sets, 224 
Relative: 
complement, 17 
topology, 3 
Relatively invariant measure, 265 
Residual set of measure zero, 66 
Right: 
Haar measure, 252 
translation, 6 
Ring: 
generated by a lattice, 26 
of sets, 19 


Same distribution, 202 
Section, 141 
Semiclosed interval, 32 
Semiring, 22 
Separable: 
measure space, 168 
topological space, 3 
Separated measurable group, 273 
Set, 9 
Set function, 30 
Sides of a rectangle, 137 
Signed measure, 118 
o—algebra, 28 
o—bounded, 4 
o—compact, 4 
o—finite measure, 31 
which is infinite on every inter. 
val, 183 
o-ring, 24 
Simple function, 84 
Singular, 126 
Space, 9 
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Sphere, 5 Topological (Cont.): 
Standard deviation, 196 space, 3 
Stone’s theorem, 170 Topology: 
Strong law of large numbers, 204, of a metric space, 5 

205 of the real line, 3 
Subadditive, 41 Totally finite and o-finite, 31 
Subbase, 3 Total variation, 122 
Subgroup, 6 Transfinite generation of o-rings, 
Sum of regular outer measures, 53 
Subpartition, 32, 48 Transformation, 161 
Subspace, 3 
Sabeenctive: 37 Uncorrelated, 196 
Sup, 12 Uniform: 


absolute continuity, 100 
continuity, 7 
convergence a.e., 87 


Superior limit, 16 
Supremum, 1 


Symmetric: : 

difference, 18 poe 1] 

ighborhood, 7 BPpere 

Reed ee ordinate set, 142 
Tchebycheff’s inequality, 200 variation, 122 
aie “3 Variance, 194 

sabe oups, 275, 276 Vertical line, 131 
Three series theorem, 199 Weak law of large numbers, 201 
Topological: Weil topology, 273 

group, 6 Whole space, 9 


group with different left and 
right Haar measures, 256 Zero~one law, 201 


